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FOREWORD 


This  volume  contains  21  research  papers  dedicated  to  Milton  E.  Rose  on  the 
occasion  of  his  60th  birthday.  The  contributors  are  mathematicians  and  fluid 
dynamicists  who  have  known  and  worked  with  Milt  Rose  during  his  tenure  as 
Director  of  ICASE. 

These  research  papers  cover  some  recent  developments  in  numerical  analysis 
and  computational  fluid  dynamics.  Some  of  these  studies  are  of  a fundamental 
nature.  They  address  basic  issues  such  as  intermediate  boundary  conditions  for 
approximate  factorization  schemes,  existence  and  uniqueness  of  steady  states  for 
time-dependent  problems,  pitfalls  of  implicit  time  stepping,  etc.  The  other  stu- 
dies deal  with  modern  numerical  methods  such  as  total-variation-diminishing 
schemes,  higher  order  variants  of  vortex  and  particle  methods,  spectral  multi- 
domain  techniques,  and  front-tracking  techniques.  There  is  also  a paper  on  adap- 
tive grids.  The  fluid  dynamics  papers  treat  the  classical  problems  of  incompressi- 
ble flows  in  curved  pipes,  vortex  breakdown,  and  transonic  flows. 

The  editors  would  like  to  take  this  opportunity  to  thank  the  authors  for  their 
excellent  contributions  and  their  promptness  for  meeting  deadlines. 
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CONVERGENCE  TO  STEADY  STATE  OF  SOLUTIONS  OF  BURGERS'  EQUATION 


Gunllla  Kreiss 

Royal  Institute  of  Technology 
Stockholm,  Sweden 

and 

Heinz-Otto  Kreiss 

California  Institute  of  Technology 
Pasadena,  California 


Abstract 

Consider  the  initial-boundary  value  problem  for  Burgers'  equation.  It  is 
shown  that  its  solutions  converge,  in  time,  to  a unique  steady  state.  The 
speed  of  the  convergence  depends  on  the  boundary  conditions  and  can  be 
exponentially  slow.  Methods  to  speed  up  the  rate  of  convergence  are  also 
discussed. 


Research  was  partially  supported  by  the  Office  of  Naval  Research  under 
N00014-83-K-0422  and  National  Science  Foundation  Grant  DMS-8312264. 
Additional  support  was  provided  by  the  National  Aeronautics  and  Space 
Administration  under  NASA  Contract  No.  NAS1-17070  while  the  authors  were  in 
residence  at  the  Institute  for  Computer  Applications  in  Science  and 
Engineering,  NASA  Langley  Research  Center,  Hampton,  VA  23665-5225. 


1.  Introduction.  In  many  gasdynamical  problems  one  tries  to  calculate  the  steady  state 
solution  by  solving  the  corresponding  time  dependant  problem.  One  hopes  that  for  t -*  oo  the 
solution  converges  to  a unique  steady  state.  Recently,  M.  D.  Salas,  S.  Abarbanel  and  D.  Gottlieb 
[l]considered  the  initial-boundary  value  problem 


Ut  + |(< U2)*  = f(x), 


t>  0,  0 < X < 7T, 


u(x,  o)  = y(x). 


(1.1) 


They  used 

/(i)  = sinicosx,  g (a;)  = 6 sin  x,  0 < 6, 

and  showed  that  the  solution  u[x,t)  of  the  above  problem  converges  to  a steady  state  v(x),  as 
t —*  oo,  but  that  v(x ) depends  on  the  initial  data. 

In  this  paper  we  consider  the  viscous  problem 

ut  + - {n2)x  = euxx  + f(x),  t > 0,  0 < x < 1,  e > 0,  (1.2a) 

with  initial  and  boundary  conditions 


u(x,0)  =g{x), 

«(0,t)=a,  u(l,t)  = 6, 

and  the  corresponding  steady  state  problem 

2 {v2)x  — £Vxx  + f{x),  0 < x < 1,  e > 0, 

y(0)  = a,  y(l)  = b. 

For  simplicity  we  restrict  ourselves  to  two  cases: 


(1.26) 


(1.3) 


1)  a > 0 > 6,  a > —6,  f(x)  = 0, 

2)  a = b = 0,  f is  such  that  there  exists  an  a with  0 < a < 1 such  that  f(x)  > 0 for  0 < x < a, 
fix)  < 0 for  a < x < 1,  /( 0)  = /( 1)  = 0,  /,( 0)  > f0  > 0 and  /,( 1)  > /0. 


We  will  show  that  (1.3)  has  a unique  solution  and  discuss  the  properties  of  y(x).  We  shall 
also  show  that  in  all  cases  we  consider,  the  limit  of  y(x)  as  e — ► 0 exists.  Thus,  if 

tUm  u(x,  t)  = y(x) 
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exists,  we  obtain  a unique  steady  state  solution  of  the  inviscid  equation  (1.1)  if  we  first  let 
t — ► oo  and  then  e — * 0.  This  is  in  contrast  to  the  procedure  in  [ l],  where  the  two  limit 
procedures  are  taken  in  the  reverse  order. 

We  shall  prove  that  the  eigenvalues  of  the  eigenvalue  problem 

Ay?  = -(yph  + £(Pxx,  <p{  0)  = ^(1)  = 0,  (1.4) 

are  all  negative.  Therefore,  the  solution  of  (1.2)  converges  to  the  solution  of  (1.3)  provided 
u(x,  0)  = g(x)  is  sufficiently  close  to  y(x).  In  another  paper  we  shall  prove  that  u(x,  t)  converges 
to  y(x)  as  t — oo  for  arbitrary  initial  data.  The  speed  of  convergence  is  determined  by  the 
eigenvalues,  Ay,  of  (1.4).  We  shall  show  that  the  eigenvalue  distribution  depends  on  f{x)  and 
on  a,  6 in  the  following  way: 

There  is  a constant  c > 0 which  does  not  depend  on  e such  that 


(1)  if 

(2)  if 

(3)  if 

(4)  if 


a > —b,  f = 0 then  0 > —c/e  > Ai  > A2  > • • • 
a = —6,  / — 0 then  — Ai  = 0(e~1^e)  > 0,  —c/e  > A2  > A3  > • • • 
a = b = 0,  [ f(x)dx^  0,  then  — c > Ai  > A2  > • • • 

Jo 

a = 6 = 0,  f f(x)dx  = 0,  then  — Ai  = 0(e-1/f)  > 0,  — c > A2  > A3  > • • • 

Jo 


(1.5) 


We  expect  a reasonable  speed  of  convergence  in  the  first  and  third  case,  while  in  the  second 
and  fourth  case  the  speed  should  be  extremely  slow  due  to  the  eigenvalue  -Ai  = 0(e-1/r).  This 
is  confirmed  by  numerical  experiments.  We  see  that  at  first  u(x,  t)  quite  rapidly  approches  the 
same  limit  as  the  inviscid  equation  (1.1),  which  consists  of  solutions  of  the  stationary  equation 


i(«2),  = /(*) 


connected  by  a shock.  Once  the  viscous  shock  has  been  formed,  the  solution  of  (1.2)  becomes 
quasi-stationary  and  the  shock  creeps  extremely  slowly  to  the  “right”  position.  We  can  ex- 
plain the  behavior,  because  by  linearizing  around  the  quasistationary  solution  we  find  that  the 
eigenvalues  of  the  corresponding  eigenvalue  problem  have  a similar  distribution  as  earlier. 

If  — Ai  = 0(e-1/r)  then  the  speed  of  convergence  is  so  slow  that  the  above  method  to 
calculate  the  steady  state  is  impractical,  see  figures  (1)  and  (3).  However,  we  can  use  the  same 
technique  as  Hafez,  Parlette  and  Salas  in  [2]  to  speed  up  the  convergence.  See  figures  (2)  and 

(4). 

Unfortunatly,  not  only  the  speed  of  convergence  but  also  the  condition  number  of  the 
stationary  problem  deteriorates.  We  have  to  calculate  with  0(el/e)  decimals  to  obtain  correct 


results.  To  avoid  an  excessive  number  of  decimals  we  have  used  a quite  large  e in  our  numerical 
calculations. 

The  situation  becomes  much  better  in  a two  dimensional  case,  which  we  discuss  in  the  last 
section.  Now  there  is  a whole  sequence  of  eigenvalues 

~V\ j=0(j2e),  j = 1,2 

close  to  zero.  However,  they  are  only  algebraically  and  not  exponentially  close  to  zero.  We 
indicate  how  to  modify  the  procedure  to  accellerate  the  speed  of  convergence. 

We  believe  that  the  viscous  model  (1.2)  better  explains  what  happens  in  actual  calculations 
than  the  inviscid  equation  (1.1).  Practically  all  numerical  methods  have  some  viscosity  built  in. 
Also,  from  a physical  point  of  view,  the  solution  we  are  interested  in  is  the  limit  of  solutions  of 
a viscous  equation. 

Finally  we  want  to  point  out  that  the  appearance  of  small  eigenvalues  has  also  been 
observed  by  D.  Brown,  W.  Kath,  H.  O.  Kreiss  and  W.  Henshaw,  M.  Naughton  (private  com- 
munication). 


2.  Uniqueness, existence  and  properties  of  the  steady  state  solution.  We  start 
with  uniqueness,  which  can  be  proven  by  standard  techniques. 

Lemma  2.1.  If  the  steady  equation  (1.3)  has  a solution,  then  it  is  unique. 

Proof.  Let  u,  v be  two  solutions.  Then  w = it  — v is  the  solution  of 


~(piv)x  = ewxx,  p = u + v,  w(0)  =m(l)  =0. 


(2.1) 


If  w ^ 0 then  the  zeros  of  w are  isolated.  Let  x with  0 < x < 1 be  the  first  zero  to  the  right 
of  x = 0.  Without  restriction  we  can  assume  that  w > 0 for  0 < x < x,  i.e.  wx(0)  > 0 and 
wx(x)  < 0.  Integration  of  (2.1)  gives  us 


-eflu/sfy)!  + K(0)|)  = e[wx}x0  = i(ptu]g  = 0. 


Thus  w^O)  — wx(x)  = 0.  We  can  consider  (2.1)  as  an  initial  value  problem  with  initial  data 
w(0)  = ^2(0)  — 0 whose  solution  is  w(x)  = 0,  and  the  lemma  is  proved. 

We  shall  now  discuss  the  properties  of  the  solution.  Let  us  start  with  the  case  f(x)  = 0, 
a > 0 > b,  a > —b.  Integrating  (1.3)  gives  us 


0 < x < 1, 


y(0)  = a. 


(2.2) 
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The  constant  c has  to  be  determined  so  that  y(l)  = b.  We  necessarily  have  c = dP/2  > a2/ 2, 
because  with  c < ge2/2,  yx  > 0 for  all  x,  and  y(  1)  = b cannot  be  satisfied.  We  can  solve 
equation  (2.2)  explicitly.  This  is  done  by  writing  (2.2)  in  the  form 


2e 


»(*) 

/i 


dy 


y2  -d? 


dx, 


i.e. 


.a  + d.(y(x)  -d 
a — d 'y(x)  +d 


) = edx !• 


Therefore  y(l)  = 6 implies  d = a + 0(e  1/e),  and 


y(X)  - a1  + re-a(l-i)A’ 


with  r 


a — 6 
a + 6‘ 


(2.3) 


Away  from  the  boundary  layer  at  x=l  we  have  y(x)  = a + 0(e~a^l~x^e).  Thus,  for  e — ► 0,  y(x) 
converges  to  a for  0 < s < 1. 

If  a = —b  we  consider  (2.2)  on  the  interval  0 < x < with  boundary  conditions  y(0)  = 
a»  y( 5)  = 0 and  obtain  a solution  yi(x)  of  the  form  (2.3).  The  solution  on  the  whole  interval 
is  given  by 


y(z)  = 


( y i(*), 

\ —1/1(1 


if  0 < x < i, 

x),  if  \ < x < 1. 


In  figures  (9)  and  (10)  we  have  plotted  y(x)  for  two  different  sets  of  boundary  values. 

Consider  case  2,  where  / only  vanishes  at  x = 0,  a,  1 and  a = b = 0.  Without  restrictions 
we  can  assume  that 


j f{x)  > 0. 


0 


(2.4) 


If  this  is  not  true,  we  transform  the  problem  by  introducing  new  variables, 

x = 1 - x,  / = -/,  y = -y. 


The  new  problem  satisfies  (2.4). 

Lemma  2.2.  Let  y(x)  be  the  solution  of  (1.3),  F(x)  = f{£)d£  and 

Then 

y»(l)  < y»(0)  <KU  KX=  max  {|M*)|}  + 


h(x)  = s/W{x). 
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Proof.  Integration  of  (1.3)  gives 


e(y*  - y*(0))  = |y2  - F, 
y(o)  =0, 


(2.5) 


where  y*(0)  is  determined  by  y(l)  =0.  If  u = y - h,  then  u is  the  solution  of 


ux  = yx(0 ) — hx  + e luh  + ^-e  1u2, 

& 


u(0)  = 0. 


Assume  that  yx(0)  > K\.  It  follows  that  yx{0)  — hx(x)  is  positive  and  thus  u and  ux  are  positive 
for  all  x > 0.  In  particular  u(l)  > 0 and  y(l)  = u(l)  + h{l)  > 0,  which  contradicts  y(l)  = 0. 
Thus  yx(0)  < Kt.  Also 

ey*(l)  = ey*(0)  - F(l)  < ey*(0). 

This  proves  the  lemma. 

Lemma  2.3.  Let  y(x)  be  the  solution  of  (1.3)  and  let  e be  sufficiently  small.  If  F(l)  > 0 
then  y(x)  > 0 for  0 < x < 1 and  y(x)  has  exactly  one  maximum.  If  F(l)  = 0 then  there  exists 
an  x with  0 < x < 1 such  that  y(x)  > 0 for  0 < x < x,  and  y{x)  < 0 for  x < x < 1.  Also  y(x) 
has  exactly  one  minimum  and  one  maximum.  In  both  cases  \y{x)\  < max|F(x)|. 

Proof.  At  extrema  yx  = 0 and 


f C° 

Vxi  = £ 1 f = s = 0 

l>0 


for  0 < x < a, 
for  x — a, 
for  a < x < 1. 


(2.6) 


Thus  y cannot  have  a minimum  to  the  left  of  a maximum.  Since  y(0)  = y(l)  = 0 there  are  only 
three  possibilities,  namely 


y > 0 

for 

0 < x < 1, 

y has  exactly  one  maximum, 

(2.7a) 

y < 0 

for 

0 < x < 1, 

y has  exactly  one  minimum, 

(2.76) 

y > 0 

for 

0 < x < x, 

0 < x < 1, 

(2.7c) 

y < 0 

for 

X < x < 1, 

y has  exactly  one  maximum  and  one  minimum. 
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We  shall  prove  that  if  F(l)  > 0 then  (b)  and  (c)  are  not  possible,  and  that  if  F(l)  = 0 
then  (a)  and  (b)  are  not  possible. 

Let  F(  1)  > 0.  Suppose  (2.7b)  holds.  Then 


y*(0)<0,  y,(l)>0. 


By  (2.5) 


0<e(y*(l)-y*(0))=-F(l)<0.  (2.8) 

This  is  a contradiction,  so  (2.7b)  cannot  hold.  Now  supposse  (2.7c)  is  valid.  Then  yx(0)  > 0 
and  by  (2.8) 

y,(0)>  e^FU). 


If  e is  small  enough  this  is  impossible  by  lemma  2.2. 

Let  F(  1)  = 0.  Assume  that  (2.7a)  or  (2.7b)  are  valid.  By  (2.8)  yx{ 0)  = y*(l),  which  is 
only  possible  if  yx(0)  = yx{  1)  =0.  Differentiating  (1.3)  gives  us 


£yxxx  — yyx%  -t*  (yx)  fx • 


(2.9) 


Thus 


y(0)  = yj(0)  = yu(0)  = 0,  yII3(0)  < 0, 
y(l)  = yrr(l)  = y^(l)  = 0,  yxxx{\)  < 0. 

This  implies  that  y must  change  sign  at  least  once,  which  contradicts  the  assumption,  and 
therefore  (2.7c)  must  hold. 

It  remains  to  show  that  |y(x)|  is  bounded  by  max|F(x)|.  Since  y(0)  = y(l)  = 0,  the 
maximum  absolute  value  of  y is  found  at  a local  extrema,  where  yx  = 0.  Thus,  from  (2.5)  it 
follows  that 

|y(ar)|  < max  \F(x)  - ey^O)!  < max  |F(x)|. 


This  finishes  the  proof. 

We  can  use  the  usual  singular  perturbation  methods  to  discuss  the  behavior  of  the  solution 
in  detail,  see  for  ex.  [3]. 

Theorem  2.1.  Let  F(l)  > 0,  assume  that  (1.3)  has  a solution  and  that  e is  sufficiently 
small.  Then  y(x)  has  a boundary  layer  at  x = 1.  For  1 — 0(e|  log(e)|)  < x < 1,  y{x)  is  close  to 
tu(x)  which  is  the  solution  of 


-m, 


— oo  < x < 1,  tw(l)  = 0. 


(2.9) 


In  any  interval  0<a;ofs^<l  — 0(e|  log(e)|) 


y(x)  = h(x)  + (z,  e),  h(x)  = \j2F{x)  xg(x), 


(2.10) 
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where  ui  and  its  derivatives  are  bounded  independantly  of  s.  For  0 < x < x0  < a we  have 

y(s)  =h(x) +eu(5),  x = x/y/e , (2.n) 

where  u and  the  derivatives  dvujdxv  are  bounded  independantly  of  e.  Thus,  for  e -»  0,  y(x) 
converges  to  h(x)  for  0 < x < 1. 

Proof.  We  indicate  only  the  proof  of  (2.11).  In  the  proof  we  shall  use  /i,/2  and  I to 
denote  the  intervals  0 < x < 1,  1 < x < x0/y/E  and  0 < x < x0/y/i , respectively.  We  shall  also 
use 

11/11/  :=max|/(5)|, 

xEI 

where  I is  an  interval. 

We  introduce  a new  variable  in  (1.3), 

y(x)  = h(x)  + eu(x/s/e). 

This  gives  us 

u-xi  - {ig{x)  + y/iu)ux  - hxu  = -hxx,  0 < x < xq/ y/e,  u(0)=0,  u{x0/^)  = uo,  (2.12) 

where  uq  = «i(:ro,e)  is  bounded  independantly  of  e.  From  x0  < a and  the  assumtion  fx( 0)  > 
fo  > 0 it  follows  that  hx(x)  > ho  > 0 for  0 < x < xo.  Therefore  we  can  use  the  maximum 
principle.  The  maximum  of  u is  found  either  on  the  boundary  or  at  a local  extrema,  where 
u%  = 0.  At  local  extrema 

M < |^.|  < £||M*)II;  ~ 

Thus 

IMI/  < max(«o,a).  (2.13) 

Next  we  want  to  estimate  ||u£||/.  First  we  consider  the  interval  I\  = [0, 1].  By  (2.12)  and 
(2.13)  there  are  constants  C\  and  C2  such  that 

||u5i||/,  ^ C*i H^ill/i  + C2. 

It  is  well  known,  see  Landau  [4],  that  one  can  estimate  ||ui||/,  in  terms  of  |M|/,,  and  Hu^H/,, 
i.e.  for  every  constant  8 there  is  a constant  C(6)  such  that  ■ «> 

Itell/,  <$11^11/,  + C(£)|M|/,. 

Thus  for  5 = ^(Cj)"1  we  obtain  a bound  for  ||it55||A,  which  gives  us  a bound  for  ||u5|k. 
Especially,  |«i(l)|  is  bounded. 

In  the  remaining  interval  J2  = [l,x0/v^],  we  have 

F>F(y/i)=efx(0)(l  + O(y/Z)). 

Thus 

xg  + y/eu  = V2F/y/e  + y/e  > \Jfx(0)  + 0{y/e), 
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i.e.  for  sufficiently  small  y/e 


xg  + s/eu  > 0) 


At  local  extrema  of  ti^,  u^x  = 0 and  we  have,  by  (2.12), 


lu*l  - >//  (ofxx  hxti\  - yf(o)(lMk  + 11^11/2 11^11/2)  — : P- 


Thus 

IMI/a  <max(|u5(l)|,|u5(^)|,/?), 

V£ 

and  is  bounded  independantly  of  e in  the  whole  interval.  By  differentiating  (2.12)  bounds 
for  higher  derivatives  of  u can  be  obtained. 

It  is  also  clear  that  as  e — ► 0,  y(x)  converges  to  h(x).  This  finishes  the  proof. 

If  F(  1)  = 0 then  the  solution  switches  at  x from  \/2F  + O(e)  to  —y/2F  + 0(e).  In  each 
subinterval  0 < x < x and  x < x < 1 the  local  behavior  of  the  solution  is  of  the  same  type  as 
in  the  first  case.  As  e — ► 0,  y(x)  converges  to  h(x)  for  0 < x < x and  to  —h(x)  for  x < x < 1. 
In  general,  the  position  of  x can  only  be  obtained  by  detailed  calculation.  However,  if  f(x)  is 
antisymmetric  around  x = \ then  x = i.  This  is  the  only  case  we  consider. 

We  shall  now  discuss  the  existence  of  a solution.  For  this  we  need  two  lemmata. 

Lemma  2.4.  For  sufficiently  large  e the  steady  state  equation  (1.2)  has  a solution. 
Proof.  By  integrating  (1.3)  twice,  we  can  write  the  equation  in  the  form 


X X 

yfc)  = |»7 J y2(0d£ -v J + vxc0,  v = i/e, 

0 0 

1 1 

|/y2m-/f({)cl{+cl 0 = 0, 


or  after  the  change  of  variable  y — t]y 


y(x)  = \n2  f y2m-  j F(£)dt+xc0> 

0 0 

1 1 

^?2/  y2(m-f  W+co=o. 

0 0 
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For  7?  = 0 the  above  equations  have  a unique  solution.  Therefore  the  same  is  true  for  all 
sufficiently  small  r).  This  proves  the  lemma. 

Lemma  2.5.  Let  p(x)  be  a smooth  function.  Consider  the  eigenvalue  problem 

Ay?  = ~{p<p)x  + e<pxx,  <p{  0)  = <p{  1)  = 0.  (2.14) 

The  eigenvalues  are  real  and  negative. 

Proof.  We  introduce  a new  variable  xp(x)  by 


/ p( 

<p(x)  = e > t p(x), 


and  obtain 


\xfj  = ex pxx  - exp  =:  Lxp,  c(x)  = \px(x)  + -5-(p(x))2, 

£ 4 6 


xp{0)  =xP(l)  =0. 


(2.15) 


(2.15)  is  selfadjoint  and  therefore  the  eigenvalues  are  real.  Let  <p  £ 0,  A be  a solution  of  (2.14), 
and  let  x be  the  first  zero  of  <p  to  the  right  of  x = 0.  We  can  assume  that  <p  > 0 for  0 < x < x. 
Thus  <px{Qi)  > 0 and  <px{x)  < 0,  and  integration  of  (2.14)  gives  us 


X 


0 


<0. 


It  follows  that  A < 0.  If  A = 0,  the  only  possible  solution  of  (2.14)  would  be  <p(x)  = 0.  Thus 
A < 0,  which  proves  the  lemma. 

Now  we  can  prove 

Theorem  2.2.  The  equation  (1.3)  has  a unique  solution  for  all  e > 0. 

Proof.  We  have  already  shown  that  (1.3)  has  a solution  for  sufficiently  large  e.  We  will 
now  employ  continuation  in  e to  prove  existance  for  all  e > 0.  Assume  we  have  shown  existance 
for  e > e.  We  want  to  show  that  there  is  a solution  for  e = e.  By  lemma  2.3  the  solutions  of 
(1.3)  are  uniformly  bounded  for  e < e < e + 1.  Therefore  the  same  is  true  for  the  first  three 
derivatives.  Thus  we  can  select  a sequence  of  solutions 

y{x,e u),  v = l,2,...,  lim  ev  = e, 

V—*  oo 


such  that 


d’ 


d’ 


j =0,1,2 


dx} ' 
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and  y{x,e)  is  the  desired  solution.Linearizing  the  equation  around  y(x,e)  gives  us 

(y(®»e)^y)a  = e{Sy)xx  + (e-e)y(x,e),  Sy(  0)  = Sy(l)  = 0. 

By  the  previous  lemma  A = 0 is  not  an  eigenvalue  of  the  above  equation  and  therefore  we  can 
solve  (1.3)  for  all  sufficiently  small  e — e.  This  proves  the  theorem. 


3.  Speed  of  convergence.  In  this  section  we  want  to  discuss  the  speed  of  convergence 
to  steady  state.  We  assume  that  the  initial  data  g(x)  of  (1.3)  are  sufficiently  close  to  the 
solution  of  the  steady  problem,  so  that  we  only  have  to  discuss  the  behavior  of  the  solutions  of 
the  linearized  equation 


wt  + {yw)x  ~ ewxx,  0 < x < 1,  £ > 0, 

u;(r,0)  = j(x),  (3.1) 

w(0,  t)  = tu(l,  t)  = 0. 


To  determine  the  speed  of  convergence  we  study  the  distribution  of  eigenvalues  of 

+ ( y<p)x  = e<Pxx , <p(0)  = <p(  1)  = o.  (3.2) 

Theorem  3.1  . The  eigenvalues  of  (3.2)  are  real  and  negative  and  their  distribution  is 
given  by  (1.5). 

Proof . Lemma  2.5  tells  us  that  the  eigenvalues  are  real  and  negative.  First  we  consider 
the  case  / = 0,  a > —b.  We  write  (3.2)  in  the  selfadjoint  form  (2.15)  with  p = y.  Let  A = Ax 
be  the  largest  eigenvalue.  The  corresponding  eigenfunction  xpi  does  not  change  sign,  and  we 
can  assume  that  ipi  > 0 for  0 < x < 1 and  that  max  |V>i(a:)|  = 1.  We  assume  that  Ax  > —a2/ 8e. 
Then  there  is  a constant  K such  that  c(i)  + Ax  > 0 for  0 > x > 1 — Ke.  Thus  V’l  is  monotone 
in  the  interval  0 < x < 1 — Ke,  and  therefore  rpi  must  have  its  maximum  in  the  remaining 
interval,  1 — Ke  < x < 1.  By  assumption  mzxxpi(x)  = 1 and  therefore  there  must  be  a constant 
6 > 0 such  that  V’ii(l)  < —S/e.  Now  consider  the  corresponding  eigenfunction 


<pi(x)  = e • ipi (r),  <pix{±)  = tMl),  0 < <px(x)  < ^(x). 

Integrating  (3.2)  gives  us 


—S  > e(<Pla:(l)  - <Px*(0))  = A if  <pidx  > 

Jo 

>Ax  f1  e*'~lfi'yd*dx  = \1ed. 

Jo 
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Thus 


Ai  < 


. ,<?  i. 

mn,S’Zh 


and  the  theorem  is  proven  for  this  case. 

When  / ^ 0,  a = b = 0,  and  /q1  f(x)dx  > 0 the  corresponding  estimate  follows  in  the  same 
way,  since  by  theorem  2. 1 there  are  constants  Co  > 0 and  K such  that 

c(x)  = ^(hx(x)  + 0(y/s)  + + 0{yje))2)  > Co  > 0 for  0 < x < 1 — Ke. 

We  now  consider  the  antisymmetric  case  when  a = —6,  / = 0ora=6  = 0 and  f(x)  is 
antisymmetric  around  x = We  want  to  show  that 


— Ai  =0(e-1e"1/c). 

We  shall  use  the  fact  that  for  our  selfadjoint  eigenvalue  problem  (2.15)  the  eigenvalue  with  the 
smallest  absolute  value,  Ai,  satisfies 


|Ai|  < 


Wh 
Uh  ’ 


for  any  smooth  function  4>  ^ 0 satisfying  the  boundary  conditions.  We  chose 


* 1/2 

f y(()d( 

4>{x)  = e 1/2  — e 0 

as  trial  function.  y(x)  is  antisymmetric  around  x = and  <£(0)  = <£(1)  = 0.  Also 


1/2 


T i <y2  ,y*\ 
L*  = (-Te+J)e 


-i*-'  / 


Both  <p2  and  (|e  ly2  + ^yx)2  are  symmetric  around  x = Therefore 


1/2  2 

IIWIP  = 2 /(fj 


0 


+ 


1/2 

u\\2  = 2 j 2ydHe^~l^y^-ifdx, 


o 
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and  by  (2.5)  and  theorem  2.1 


2<jMjf 
1 - IMP 


1/2  2 

/ (£  + *t)2dx 

J ; < C2e-2e~2D/r, 


1/2  5*~l  f yd( 
f (e  0 — l)2d:r 

o 


where  C > 0 , D > 0 are  constants  which  do  not  depend  on  e. 

We  shall  now  estimate  the  size  of  the  second  eigenvalue  for  the  case  with  an  interior 
boundary  layer  at  x = |.  By  assumption  y{x)  is  antisymmetric  around  x = A.  Consider  the 
eigenvalue  problem  (3.2)  on  half  the  interval,  0 < x < A,  and  denote  its  solutions  by 


V^>»(2'))  Aj,  i — 1,2...  . 

We  know  that  <pi  has  * - 1 sign  changes,  and  we  have  already  shown  how  the  Vs  are  bounded 
away  from  zero.  The  function 


<P2i{x)  = 


j <Pi(x)  for  0 < x < A 

\ —<Pi(x  — A)  for  A < x < 1 ’ 


i = 1,2.. 


> 


will  satisfy  (3.2)  on  the  full  interval,  0 < x < 1 with  A = A2{  = V Also  y>2\  changes  sign 
2(i  - 1)  + 1 times.  Thus  <p2t  is  the  2 Ith  eigenfunction  and  A2,  is  the  2ith  eigenvalue.  Therefore 
A2  is  bounded  away  from  zero.  This  finishes  the  proof. 


4.  Numerical  results.  We  shall  discuss  difference  approximations  for  the  time  depen- 
dant problem  (1.2)  and  the  eigenvalue  problem  (3.2).  We  introduce  gridpoints 

(xi  = ih,t,=jk),  i = 0,1,...  y = 0, 1 N,  h = l 

where  N is  a natural  number  and  Ar  > 0 is  the  time  step.  Wc  also  introduce  gridfunctions 

=u(xi,tj). 

We  approximate  (1.2)  by  the  usual  implicit  method 


(/  - ekD+D-)ul+l  + ^£>o(ufH)2  = + kfit  i = 1, 2, . . . , N - 1 


(4.1) 


with  initial  and  boundary  conditions 

= (ft,  i — 1|  2, . . . , i\T  1, 

«o  = = 3 ~ 1»  2, . • • 

Here 

h2 D+D-Ui=  «,+i  - 2u,-  + «j_i  and  2hD0(ut)2  = (u,+1)2  — («,-i)2 

denote  the  usual  centered  difference  operators.  At  every  time  step  one  has  to  solve  a nonlinear 
system  to  determine  t^+1  . This  is  done  by  the  iteration 

{I-ekD+D-)uf+1)  =~kDo{u^)2  + t4+kfi,  1 = 0,1...  , (4.2) 

where  is  choosen  by  a predictor  process. 

In  all  our  experiments  the  solution  of  (4.1)  converges  to  a steady  state  solution.  However, 
the  speed  of  convergence  depends  on  the  location  of  the  shock.  If  the  shock  is  located  at  the 
boundary,  corresponding  to  the  first  and  third  case  of  (1.5),  then  the  convergence  to  steady 
state  is  quite  rapid.  See  figure  (5).  If  on  the  other  hand  the  shock  is  located  in  the  interior, 
corresponding  to  the  other  cases  of  (1.5),  the  convergence  is,  in  general,  very  slow.  When  the 
shock  is  formed  at  an  early  stage  it  is  in  general  in  the  “wrong”  place,  depending  on  the  initial 
data.  From  then  on,  the  the  shock  moves  slowly  to  the  correct  position.  See  figures  (1),(3).  This 
process  can  be  considered  quasi-stationary,  which  makes  it  possible  to  use  the  same  convergence 
acceleration  as  in  [2]. 

Formally  we  can  write  our  iteration  (4.1)  as 

H(un+1 ) = u"+1  - u"  :=  r".  (4.3) 

We  can  linearize  the  realation  and  obtain 

(/  — L)rn+1  = rn.  (4.4) 

In  our  case 

Lr{  = ekD+D-n  - kD0{u?+lri).  (4.5) 

This  is  a discretization  of  the  right  hand  side  of  the  eigenvalue  problem  (2.14),  with  p = un.  If 
the  process  is  quasi-stationary  we  can  consider  L to  be  independant  of  n.  Then  we  have 

r"+;  = (/  - L)-’rn 

and 

p-i 

un+p  = un  + £(J  - L)~3rn. 

y=o 

If  the  eigenvalues  A,-,  of  L are  negative  the  eigenvalues  k,,  of  (I  — L)-1  satisfy  |/c,|  < 1 and 

lim  un+p  =«"  + (/-(/- L)-1)"1^  = un  + (I-  L~l)rn  (4.6) 

p— ►CO 
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Instead  of  taking  a large  number  of  time  steps  we  can  take  one  large  step,  which  we  call  an 
extrapolation  step.  We  put 

« = «n  + /?e,  (4.7) 

where  e is  the  solution  of  the  equation 

Le={L-  I)rn,  (4.8) 

and  0 is  a stabilizing  parameter.  We  choose  /?  in  such  a way  that  H(un+0e)  has  no  component 
in  the  direction  of  e,  i.e. 

(H(un+/3e),e)=0, 

where  (•,  •)  denotes  the  usual  inner  product.  There  are  other  possible  choices,  for  example 
choose  /?  such  that 

\\H(un+/3e)\\  = min\\H(un+0e)\\. 

P 

Of  course  (4.7)  is  not  the  steady  solution  we  are  seeking.  We  use  the  new  u to  restart  the  time 
iteration,  and  make  a new  extrapolation  step  once  a new  quasi-stationary  state  is  reached.  In 
our  experiments  we  use  an  a priori  fixed  number  of  time  steps  between  the  extrapolation  steps. 
Better  strategies  are  under  development. 

We  have  calculated  the  first  eigenvalues  and  eigenvectors  of  the  discrete  linearized  operator 
(4.5),  provided  is  the  discrete  steady  state  solution.  The  calculations  show  that  the 
eigenvalues  are  negative  and  their  distribution  is  of  the  same  type  as  for  the  corresponding 
continous  case.  See  table  (1).  In  figures  (6), (7)  the  first  few  eigenvectors  are  plotted.  Note  that 
in  the  case  of  an  interior  shock  the  first  eigenvector  is  exponentially  small  away  from  the  shock 
region.  Also,  we  have  no  doubt,  and  it  is  confirmed  by  the  calculations,  that  the  position  of  the 
shock  does  not  change  the  nature  of  the  eigenvalue  distribution.  In  fact,  in  the  proof  of  theorem 
31,  y can  be  replaced  by  any  function  of  the  same  structure. 

In  our  case,  when  the  shock  is  located  in  the  interior,  (7  — L)  ^ has  only  one  eigenvalue, 
ku  close  to  zero.  All  other  eigenvalues  are  small.  Therefore,  when  we  have  reached  the  quasi- 
stationaiy  state,  rn  is  in  the  direction  of  the  eigenvector  corresponding  to  Ki.  See  figure  (8). 
Therefore  we  do  not  need  to  solve  (4.8),  and  instead  of  (4.7)  we  use 


tx  = txn  + fir”.  (4.9) 

In  figures  (2), (4)  we  have  plotted  u at  different  time  stages  to  show  how  the  convergence  is 
accelerated. 
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5.  A twodimensional  case.  Consider  the  following  problem 


«t  + (^«2)x  = s{ux x + tiyy),  0<x<l,  0 <y  < l,  t>0, 

«(0.y,t)  = a,  ti(l,y,t)  = -a,  a > 0,  (5.1) 

u(x,0,  t)  = u(a;,l,  t ) = to(a:), 
u{x,y,0)  -g(x,y), 

where  W'(ar)  is  the  solution  of  the  one  dimensional  problem  (1.3)  with  6 = —a,  and  f(x)  = 0. 
See  (2.3).  A steady  solution  of  (5.1)  is  u(x,y)  = w(x). 

The  speed  of  convergence  can  be  studied  by  analyzing  the  corresponding  eigenvalue  prob- 
lem 

(iip  + ( tv<p)x  = e(<Pxx  + <Pyy),  <p  = 0 on  the  boundary.  (5.2) 

We  can  solve  (5.2)  by  separation  of  variables.  Let  <p(x,  y)  = X(x)Y(y).  Then 

(wX)'-eX"  = \X,  X(0)=X(1)=0,  (5.3a) 

Y"  = -qY,  K(0)  = T(l)  = 0,  (5.36) 

with  fi  = \ — eq.  We  recognize  (5.3a)  as  (3.2).  Therefore  -Ax  = 0(e-1/e)  and  -Ay  > 0(l/e), 
j = 2,3, . . ..  We  can  solve  (5.3b).  The  solution  is 

Yj(y)  = s«n  tiny),  qj  = {jirf,  j=  1,2.... 

There  is  a whole  sequence  of  eigenvalues,  fxXj,  of  order  0(e).  The  eigenfunctions  corresponding 
to  this  sequence,  <pi}-,  will  be  exponentially  small  away  from  the  shock.  All  other  eigenvalues 
will  be  of  order  0(l/e). 

We  expect  that  the  time  iteration  will  again  lead  to  a quasi-stationary  state,  and  that 
the  residual  will  be  composed  of  eigenfunctions  corresponding  to  the  eigenvalues  of  order  O(e). 
Therefore  e in  (4.8)  will  be  of  the  same  form,  and  we  can  replace  all  components  of  e away  from 
the  shock  by  zero,  thus  obtaining  a linear  system  of  equations  of  order  N instead  of  IV2.  More 
details  will  be  given  in  another  paper. 
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Tbble  1, 

Eigenvalues  of  the  eigen\ralueproblem  (3.2),  y is  the  solution  of  (1.3).  Three  different  cases  were  treated . 
The  discretization  is  done  according  to  (4.5),  with  N = 100  gridpoints.  The  eigenvalues  were  found  using 
inverse  iteration.  Eigenvectors  corresponding  to  case  (l)  are  plotted  in  figure  (6a, b). 


^1 

a2 

*3 

/(x)  = sm(27rx)/2 
a = b = 0 
e = 0.04 

-8.64  • 10~3 

-4.34 

-5.32 

/(x)  = sin(27rx)/2 
a-b  = 0 
e = 0.02 

-4.62  • 10~6 

-5.617 

-5.622 

f(x)  = 0 
o = l,  b = -l 
£ = 0.02 

-1.24 -10-9 

-12.8 

-13.5 

18 


0 


25 


50 


75 


1.0 


Figure  1.  Convergence  in  time  without  convergence  acceleration.  Numerical  solutions  at 
different  time  stages  for  the  case  e = 0.05,  / = 0,  a = 1,  b — —1,  u(x,  0)  = 1 + 2(e“2x  — l)/(l  — e~2). 
Between  each  curve  there  are  200  time  steps  = 40  time  units.  The  calculation  is  made  with  time  step  k 
— 0.2  and  N=50  grid  points. 
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Figure  2.  Convergence  in  time  with  convergence  acceleration.  Numerical  solutions  at  different 
time  stages  for  the  same  case  as  in  figure  1.  Between  each  curve  there  are  15  time  steps  and  one 
extrapolation  step.  The  same  time  step,  k=0.2,  and  number  of  grid  points  , N=50,  are  used. 
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Figure  3.  Convergence  in  time  without  convergence  acceleration.  Numerical  solutions  at 
different  time  stages  for  the  case  e = 0.04,  f = \ sin(Trx)  cos(Trx),  a = b = 0,  u(*,0)  = isin(Trx). 
Between  each  curve  there  are  100  time  steps  . The  calculation  is  made  with  time  step  k = 0.1  and  N=50 
grid  points. 
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Figure  4.  Convergence  in  time  with  convergence  acceleration.  Numerical  solutions  at  different 
time  stages  for  the  same  case  as  in  figure  3.  Between  each  curve  there  are  20  time  steps  and  one 
extrapolation  step.  The  same  time  step,  k=0.1,  and  number  of  grid  points  , N=50,  are  used. 
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Figure  5.  Convergence  when  the  shock  is  located  at  the  boundary.  Here  e = 0.04,  f(x)  = 
f sin(Trx),  u(x,  0)  = ^ sin(7ri),  N = 50,  k = 0.1.  Between  each  curve  there  are  5 time  steps. 


23 


0 .25  . 50  . 75  1.0 


Figure  6a.  Eigenvectors.  The  first  two  eigenfunctions  of  problem  (3.2),  when  y , the  solution  of  (1.3), 
has  a shock  in  the  interior.  In  this  case  e = 0.04,  f{x)  = - sin(7rx)  cos(ttx),  a = b = 0,  N = 100. 
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Figure  7.  Eigenvectors.  The  first  two  eigenvectors,  <p\  and  (p2,  of  problem  (3.2), 'when  y,  the  solution 
of  (1.3),  has  a shock  x = 1.  In  this  case  e = 0.08,  f(x)  — j sin(Trx),  a = b = 0,  iV  = 100. 
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Figure  8.  Differences  between  consecutive  solutions  at  different  time  stages,  when  e = 0.0 
/ = ^ s\n(nx)  co$(nx),  a = b = 0,  u(x,0)  = |sin(7r:r).  Between  each  curve  there  are  100  time  steps 
The  calculation  is  made  with  time  step  k = 0.1  and  N=50  grid  points. 


Figure  9.  The  solution  of  (1.2)  when  / = 0,  a = 1,6  = 0 and  t = 0.05. 
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Abstract 

The  paper  discusses  the  role  of  the  intermediate  boundary  condition  in 
the  AF2  scheme  used  by  Holst  for  simulation  of  the  transonic  full  potential 
equation.  We  show  that  the  treatment  suggested  by  Holst  led  to  a restriction 
on  the  time  step  and  suggest  ways  to  overcome  this  restriction.  The 
discussion  is  based  on  the  theory  developed  by  Gustafsson,  Kreiss,  and 
Sundstrom  and  also  on  the  von  Neumann  method. 
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INTRODUCTION 


Approximate  factorization  schemes  are  widely  used  to  obtain  efficient 
solutions  to  problems  in  Computational  Fluid  Dynamics.  In  many  cases, 
they  have  provided  a significant  increase  in  efficiency  over  previously-used 
solution  methods  in  particular  problems.  Some  outstanding  examples  are  the 
classical  Alternating-Direction-Implicit  method  of  Peaceman  and  Rachford  [1], 
the  Briley-McDonald  Linearized  Block  Implicit  scheme  [2],  and  the  Beam  and 
Warming  [3]  Approximate  Factorization  (AF)  scheme  for  the  compressible  Navier- 
Stokes  equations.  In  the  transonic  potential-flow  area,  some  AF  schemes  which 
have  significantly  improved  solution  efficiency  are  the  work  of  Ballhaus  and 
Steger  [4],  Ballhaus  et  al.  [5],  Holst  [6],  [7],  and  Jameson  [8]. 

All  of  these  schemes  have  the  common  feature  that  the  solution  procedure 
is  broken  down  into  a sequence  of  easily-implemented  stages;  i.e.,  easily- 
inverted  matrix  factors.  Each  of  the  stages  usually  requires  boundary 
conditions  for  an  "intermediate"  variable  (vector)  which  is  not  always  a 
consistent  approximation  to  the  solution  function  desired.  This  feature  can 
make  satisfaction  of  Implicit  boundary  conditions  difficult,  at  best,  and 
impossible,  at  worst.  Dwoyer  and  Thames  [9]  demonstrated  serious  boundary- 
condition  problems  associated  with  the  class  of  AF  schemes  called  "Locally 
One-Dimensional,"  even  in  explicit  schemes. 

The  present  paper  further  highlights  the  importance  of  intermediate 
boundary  conditions  by  focusing  on  a specific  example — a boundary-induced 
stability  restriction  in  Holst's  AF2  scheme  [6]  for  the  transonic  full- 
potential  equation.  An  analysis  of  the  effect  of  the  intermediate  boundary 
condition  is  given  by  use  of  the  usual  von  Neumann  method  and  also  the  methods 
of  Gustaffson,  Kreiss,  and  Sundstrom  [10]  and  Osher  [11]. 
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ANALYSIS 


Holst^s  scheme  is  a variation  of  the  AF2  schemes  presented  in  References 
4 and  5,  It  will  be  referred  to  herein  as  ,,AF2Y,n  since  in  its  implementation 
the  y-operator  is  split,  rather  than  splitting  the  x-operator  as  in  References 
4 and  5.  For  the  purpose  of  analyzing  the  intermediate  boundary-condition 
problem,  it  is  illuminating  to  study  the  application  of  AF2Y  to  the  two- 
dimensional  (2-D)  Laplace^s  equation  in  a rectangle.  The  present  analysis  is 
valid  only  for  the  subsonic  flow  condition,  which  is  simpler  by  far  than  the 
transonic  case.  However,  it  is  reasonable  to  assume  that  if  boundary-induced 
instability  is  present  in  the  subsonic  case,  it  will  also  occur  in  the 
transonic  case.  In  practice  this  was  true. 


The  Discrete  Problem 

The  following  thin-airfoil  problem  is  thus  considered:  We  wish  to  solve 

the  Laplace  difference  equation  for  the  disturbance  velocity  potential 


L*jk  = (a6xx  + 


b5  )$.,  = 0 

yy 


(i) 


where  a and  b are  constant  coefficients  and 
difference  operators;  e.g., 


5 

xx 


and  6 are  central 

yy 


5xx^jk  “ ^j+1 ,k 


+ <f> 


(2) 


The  boundary  conditions  are  set  on  a rectangular  region  with  Dirichlet 
conditions,  <f>  = 0,  set  on  three  sides  (left,  top,  and  right),  representing 
vanishing  disturbances,  and  a Neumann  condition  at  the  bottom  boundary, 
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representing  a thin-airfoil  f low-tangency  condition: 


4>y  = s(x) 


at  y = 0. 


(3) 


A discrete  analog  of  Eq.  (3)  at  k = 1 can  be  written  as: 


( t +$)*.,“  2Ays(x) 

y y 


(4) 


where  we  use  the  following  notation  for  one-sided,  two-point  differences: 


Vjk  *j,k+l  " ^jk 

(5) 

Vjk  = +jk  " ♦j.k-r 

(6) 

The  difference  operator  (1)  requires  evaluation  of  6 1 at  t*ie  boundary 

k = 1.  Since  this  operator  can  be  written  as: 


6 


yy 


(7) 


Equation  (4)  is  used  to  eliminate  ^y'l’j  which  calls  for  a value  of 

below  the  boundary  k = 1.  Thus,  the  difference  operator  at  k = 1 is: 


*jk 


= (a6xx  + - 2bAys(x). 


(8) 
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The  AF2Y  Scheme 


The  AF2Y  scheme  models  a hyperbolic  equation, 
an  iteration  scheme: 


and  is  used  as 


(a  + b j$y)(-  ab2£y  - a6xx)A«)»jk  = acoL^  (9) 

where  n is  the  iteration  counter, 


blb2  = 


(10) 


and  A<J>  is  the  correction 


AV  = *jk  -*Jk- 

The  scheme  is  implemented  in  two  stages: 

(a  + b1Xy)fjk  = oa»L^k  (12) 

(-  cb2Sy  - - fJk.  (13) 

The  intermediate  variable  f is  defined  by  Eq.  (13).  The  parameter  a 

corresponds  to  a reciprocal  "time"  step,  At"1,  and  is  usually  cycled  between 
small  and  large  values  to  obtain  rapid  convergence.  The  parameter  w 

corresponds  roughly  to  a relaxation  factor  which  is  usually  close  to  2. 

The  first  stage  (12)  is  bidiagonal,  proceeding  from  the  bottom  boundary, 
k = 1,  to  the  last  interior  row  of  mesh  points,  k = K - 1,  for  every  j.  The 
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second  stage  (13)  is  a tridiagonal  solution  which  proceeds  row-by-row,  from 

k * K - 1 to  k = 1,  to  obtain  the  correction  The  second  stage  is 

initiated  with  the  condition  Ad>.  „ = 0,  corresponding  to  the  vanishing 

J >K- 

disturbance,  cf>  = 0,  at  k = K. 


The  Intermediate  Boundary  Condition 

The  main  problem  in  implementing  the  scheme  is  how  to  initiate  the 
bidiagonal  solution  for  f at  k = 1.  It  seems  reasonable,  at  first  sight, 
to  use  a derivative  condition  on  f at  the  boundary,  as  Holst  [6]  did;  i.e., 


= 0. 


Comparison  of  Eqs.  (14)  and  (12)  implies  that 


(14) 


f 


,n 


- wL^j>1. 


(15) 


If  this  procedure  is  used  with  no  further  modification,  it  is  unstable  for 
small  values  of  a (or  large  "time"  steps)  and  fixed  u as  described  next. 

Stability  Analysis 

A von  Neumann  (VN)  analysis  shows  that  the  interior  scheme  (9)  is  stable 
for  all  modes  under  the  restrictions 


0 < m < 2 


(16) 


a > 0. 


(17) 
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However,  the  boundary  scheme,  implied  by  Eqs.  (15)  and  (13)  taken  together,  is 
another  matter. 

A boundary  condition  more  general  than  Eq.  (14)  for  f can  be 
considered.  Let  a "dummy-point"  value  for  f be  given  as: 


(cxB  + 2Abx  - iaE) (G  - 1)  = -'  2^ (A  + B - iE)  (23) 
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where 


A = a (1  - cos  ?) 

B = b (1  - cos  r|) 

E = b sin  n 
5 = PAx 

n = qAy 


(24) 


o 

The  stability  condition,  |g|  < 1, 


reduces  to: 


(A  + B)[  (2  - n)b1A  + (a  - ftbj)B]  + (a  - ftb^E2}  > 0. 


(25) 


To  maintain  the  inequality  (25)  the  following  stability  restrictions  are 
easily  deduced: 

0 < ft  < 2 (26) 

a > b^  ft.  (27) 


For  the  case  y = 1 , corresponding  to  the  backward-Neumann  condition 
on  f (Eq.  (14)),  restrictions  (26)  and  (27)  reduce  to  Eq.  (16)  and 

a > b u (y  = 1).  (28) 

The  restriction  (27)  enforces  a "time"  step  limitation  on  the  scheme  for  fixed 
ft,  which  will  slow  convergence;  or  a reduction  in  ft,  according  to: 

ft  < min  (2,  \ (29) 
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which  in  fact  yields  fast  convergence  and  ensures  stability. 

It  is  noted  that  another  useful  type  of  boundary  condition  for 
given  by 


f 


j,0 


- t A 

b^  LB  ♦ 


n 

j.l 


f is 


(30) 


which  gives  the  same  form  as  Eq.  (20)  for  fj  ^ with 


_ cx(co  + 6) 
a + ^ * 


(31) 


Both  classes  of  schemes  are  implemented  by  initiating  the  bidiagonal  march  for 
f using  Eq.  (20),  under  restriction  (29). 

The  restriction  (29)  was  verified  numerically  in  both  a constant- 
coefficient,  Cartesian-coordinate  computer  code  for  Laplace's  equation  and  in 
the  "TAIR"  code  [12]  by  using  fixed  values  for  a (i.e.,  no  a-cycling)  and  0, 
and  for  various  values  of  the  coefficient  bj.  In  all  cases,  convergence  was 
obtained  when  the  restriction  (29)  was  obeyed;  and  divergence  occurred  when  it 
was  violated. 

The  experiments  with  the  TAIR  code  were  especially  interesting,  since  the 
coefficient  b^  varies  along  the  airfoil  surface.  The  test  case  chosen  was 
the  default  "0"-type  mesh  for  an  NACA  0012  airfoil.  It  was  found  that  the 
arithmetic  mean  of  b^  along  the  surface  presented  the  crucial  condition, 
rather  than  the  maximum  value,  as  might  be  expected. 

The  question  arises  as  to  why  the  TAIR  code,  which  implements  the  AF2Y 
scheme  with  the  boundary  condition  (14),  operates  so  well  since  a is  cycled 
between  small  values,  which  violate  the  restriction  (28)  and  large  values. 
The  answer  seems  to  be  that  a is  increased  within  several  rows  adjacent  to 
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the  boundary  to  a value  which  (in  the  default  mesh)  meets  the  restriction 
(28),  when  smaller  values  of  a are  used  in  the  remaining  interior  field. 
This  "fix"  was  developed  empirically  by  the  authors  of  Reference  12;  without 
this  fix  the  code  diverges.  This  procedure  is  not  recommended  in  general, 
since  it  requires  a discontinuous  change  in  a.  The  assumption  in  the 
development  of  the  factored  scheme  (9)  is  that  a is  constant  throughout  the 
mesh. 

A seemingly  attractive  scheme,  involving  a discontinuity  in  a at  the 
boundary,  is  as  follows:  Initiate  the  solution  for  f using  Eq.  (15)  with 

to  = 1 , and  change  the  second  stage  (13)  at  the  boundary  to: 


f-2bS  - a6  )A*..«f.  .-L?$.  .. 

y xx;  j>i  B j*1 


(32) 


This  procedure  exactly  annihilates  the  boundary  residual  (in  the  linear  case) 
and  represents  a fully  implicit  satisfaction  of  the  surface  boundary 
condition.  However,  the  factored  operator  at  line  k = 2 is  no  longer  the 
interior-point  operator,  since  the  term  -ab^^  in  the  inner  factor  is 

changed  to  -2b£^  discontinuously.  It  is  possible  to  analyze  such  a scheme 
by  the  methods  presented  herein,  but  the  line  k = 2 must  be  considered  as 
part  of  the  boundary  scheme*  No  details  will  be  given  here,  but  the  analysis 
shows  that  setting  to  < 4/3  at  k = 2 guarantees  linear  stability  of  the 
overall  scheme.  However,  the  amplification  factor  modulus  |g|  exceeds  unity 
only  in  a narrow  frequency  range  of  small  r\  (Eq.  (24))  when  w > 4/3. 
Numerical  experiments  showed  no  sensitivity  to  the  value  of  to  at  k ® 2. 
This  scheme  was  always  stable  in  tests  with  a constant-coefficient  Cartesian- 
mesh  code,  even  with  to  = 1*8  at  k = 2.  If  the  scheme  was  unstable  for 
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highly  stretched  grids,  setting  to  < 4/3  at  k = 2 did  not  stabilize 
the  scheme.  In  the  variable-coefficient,  nonlinear  case,  such  a scheme  is  no 
faster  than,  and  not  as  robust  as,  the  scheme  (20)  with  restriction  (29). 


Review  of  the  Stability  Theory 

It  is  well  known  that  in  general  the  von  Neumann  analysis  at  a single 
line  is  neither  sufficient  nor  necessary  for  checking  stability.  Trapp  and 
Ramshaw  [13]  pointed  out  the  usefulness  of  the  VN  analysis  to  study  boundary 
schemes  but  recognized  that  no  theoretical  justification  was  known. 

We  wish  to  review  briefly  the  stability  theory  for  finite-difference 
approximations  to  initial  boundary-value  problems.  A necessary  condition  for 
the  stability  of  such  a scheme  is  the  Ryabenkii-Godunov  condition.  It  states 
that  the  numerical  scheme  is  unstable  if  there  exists  a solution  of  the  type 


* 


n 

3 


= G 


,n 


g|  > l 


(33) 


for  the  inner  scheme  and  the  boundary  scheme.  (It  is  also  sufficient  to  check 
one  boundary  at  a time.)  Substituting  (33)  into  (12)  and  (13)  one  finds  that 
k satisfies  a constant  coefficient  second— order  difference  scheme  whose 
solution  is 


= Uk  e1(jp6x). 


(34) 


Actually  there  are  two  possible  y^s,  but  it  is  readily  verified  that  only  one 
of  them  satisfies  |y|  <1  for  |g|  > 1;  and,  therefore,  it  is  not  a valid 
solution  for  the  quarter-plane  problem. 
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In  Appendix  A we  show  that  there  exists  a solution  of  the  form  (34)  to 

(12),  (13),  and  (20)  such  that  |g|  >1  and  |p|  < 1 if  (29)  is  not 

satisfied.  This  proves  that  the  scheme  is  unstable.  By  instability  here  we 

mean  that  unbounded  solution  occurs  after  a fixed  number  of  time  steps  for  any 

mesh — it  precludes  the  possibility  of  reaching  steady  state. 

It  should  be  noted  here  that  VN  analysis  of  the  boundary  scheme  does  not 
predict  the  existence  of  solutions  of  the  form  (33)  with  | G | > 1.  In  fact, 
Gottlieb  and  Turkel  [15]  gave  an  example  of  a boundary  scheme  (Scheme  VI,  p. 
184  of  Reference  15)  coupled  with  a variant  of  MacCormack's  scheme  in  the 
interior  which  is  conditionally  stable,  yet  the  VN  analysis  of  the  boundary 
scheme  shows  unconditional  instability.  However,  Goldberg  and  Tadmor  showed 
that  for  a dissipative  interior  scheme  (i.e.,  amplification — factor  modulus 
bounded  away  from  unity  for  all  nonzero  modes)  VN  stability  of  the  boundary 
scheme  excludes  the  possibility  of  an  eigenvalue  or  a generalized  eigenvalue. 
By  an  eigenvalue  we  mean  a solution  of  the  form  (34)  with  |g|  > 1 whereas  a 
generalized  eigenvalue  is  G such  that  [ G | = 1.  Thus,  if  the  condition 

stated  in  (29)  is  satisfied  no  eigenvalue  or  generalized  eigenvalue  exists. 

In  Appendix  A we  show  it  directly.  The  theory  of  Gustafsson,  Kreiss,  and 

• • 

Sundstrom  [10]  (see  also,  Osher  [11])  states  that  for  a system  of  first-order 
hyperbolic  equations  stability  is  assured  if  there  is  no  eigenvalue  or 
generalized  eigenvalue.  While  their  theory  does  not  apply  directly  to  the 
equation 

2 

apyt  = V p, 

it  can  be  modified  to  include  this  case. 
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As  a concluding  remark  we  should  note  that  stability  here  implies 
convergence  in  the  sense  of  Lax — the  numerical  solution  converges  to  the 

analytic  one  as  the  mesh  size  tends  to  zero  for  fixed  time  t.  This  is 

clearly  only  a necessary  requirement  to  reach  steady  state. 

Two-Dimensional  Numerical  Results 

A limited  number  of  numerical  tests  for  cases  involving  stretched  grids 
and  nonlinear  transonic  flow  have  convinced  us  that  the  discontinuous-a 
schemes  (e.g.,  Eq.  (32))  are  not  as  reliable  as  the  scheme  using  Eq.  (20)  with 
restriction  (29).  Some  numerical  results  are  presented  in  Tables  1 and  2.  In 
the  tables,  the  following  identification  is  used  for  the  various  boundary 

schemes : 

Scheme  I:  Original  TAIR  scheme;  Eqs.  (13)  and  (15)  at  boundary,  with  a 

increased  at  3 lines  adjacent  to  boundary  to  satisfy  restriction 
(28)  with  10%  safety  margin. 

Scheme  II:  Exact  annihilation  of  boundary  residual;  Eqs.  (15)  and  (32), 

with  co  = 1 at  boundary  only. 

Scheme  III:  Eq.  (20)  and  restriction  (29)  with  10%  safety  margin. 

Table  1 shows  a series  of  numerical  tests  for  incompressible  flow  over  a 
circle,  with  varying  degrees  of  mesh  stretching  near  the  boundary.  The  TAIR 
code  was  used  with  co  = 1.8  at  all  points  except  as  noted  in  schemes  II  and 
III,  and  with  the  default  settings  for  the  a-cycle  (a  min  = 0.07,  a max  = 
1.5).  The  mesh  contained  101  points  uniformly  spaced  around  the  circle  and  21 
points  in  the  radial  direction  with  stretched  spacing.  The  first  column  lists 
the  cell  aspect  ratio  at  the  boundary,  Ax/Ay  (=  b^)  , for  each  case. > The  next 
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three  columns  show  the  number  of  iterations  required  to  decrease  the  starting 
residual  by  10“^  for  three  schemes  previously  discussed.  Divergence  is 
indicated  by  an  entry  "D."  It  is  seen  that  scheme  III  is  significantly  less 
sensitive  to  grid  stretching  in  the  normal  direction  than  are  the 
discontinuous-a  schemes,  I and  II. 


Table  1. 

Number  of  Iterations 

to  Reduce 

Residual  by  10  ^ 

Incompressible  Circle 

Flow,  101 

by  21  Mesh 

Ax 

Ay 

Scheme 

i 

ii 

III 

0.5 

44 

43 

34 

1 

72 

36 

51 

10 

68 

43 

47 

20 

99 

53 

48 

100 

212 

D 

34 

1000 

400 

D 

127 

As  previously  mentioned,  the  empirically— developed  default  settings  in 
the  TAIR  code  provide  for  an  increased  value  of  a near  the  surface;  the 
default  value  satisfies  the  restriction  (28)  only  for  the  first  case  in  Table 
1,  Ax/Ay  = 0.5.  For  that  case,  convergence  is  obtained;  the  scheme  diverges 
for  the  other  listed  cases  for  which  the  default  setting  violates  restriction 
(28).  In  scheme  I,  the  value  of  a near  the  surface  met  the  restriction,  and 
convergence  was  obtained  for  all  the  listed  cases. 


It  should  be  noted  again  that  the  stability  analysis  presented  herein  is 
valid  only  for  subsonic  flow,  when  the  AF2Y  scheme  is  guaranteed  to  be 
hyperbolic  in  time.  When  the  flow  becomes  locally  supersonic,  the  linearized 
Eq.  (1)  will  have  a < 0,  and  a term  which  simulates  <|>  must  be  added  for 

Xt 

stability  [16].  The  effect  of  including  such  a term  (e.g.,  in  the  second 

factor  of  Eq.  (9))  has  not  been  studied  at  present.  With  that  cautionary 

remark,  we  present  results  for  transonic  cases  in  the  next  table. 

Table  2 presents  results  for  two  transonic  cases  for  an  NACA  0012 

airfoil:  (1)  Zero  incidence  with  free-stream  Mach  number  M = 0.85  and 

00 

(2)  2°  incidence  with  = 0.75.  All  cases  were  run  with  w = 1.8,  but  with 
different  a-  cycles.  It  can  be  seen  that  there  is  little  difference  in  the 
convergence  rate  among  the  schemes,  except  that  scheme  II  is  noticeably  slower 
than  schemes  I or  III  for  case  (2). 


Table  2.  Number  of  Iterations  to  Decrease  Residual  by  10  ^ for 

Transonic  Flow.  NACA  0012,  Default  TAIR  Mesh,  149  by  30 


Flow  Condition 

Scheme 

i 

ii 

III 

M = 0.85 
00 

Zero  incidence 

190 

174 

187 

H = 0.75 
00 

2°  incidence 

190 

360 

226 
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Three-Dimensional  Version  of  AF2Y 


A three-dimensional  (3-D)  version  of  the  AF2Y  scheme  is  presented  in  Ref. 
7.  It  is  different  from  the  2-D  version  discussed  up  to  now,  in  that  the 
factors  are  reversed  in  order.  That  is,  the  scheme  can  be  written  in  the 
present  context  as: 


(“  - t 5 J(b2  - f - blV4»jU  ““L+JU  + ab2(“  - bl?y)4+J,k-l,*  (35) 


where 


L.<f> 


jk* 


fad  + 
v xx 


b6  + 

yy 


cfi  ) <(. 

ZZ' 


jk  r 


(36) 


Because  the  factors  are  reversed,  we  will  refer  to  this  scheme  as  AF2YR. 

Here  the  third  coordinate  direction  is  z,  which  can  be  thought  of  as  the 
spanwise  coordinate  for  a wing.  The  x-  and  y-coordinates  are  still  the 
streamwise  and  normal  coordinates  as  in  the  2-D  problem.  The  boundary 
operator  corresponding  to  Eq.  (8)  is: 


Vj.M  = 06xx  + 2hty  + c6zz)  +j,l,l  " 2bAys(x)*  (37> 

The  scheme  is  implemented  in  three  stages , as  follows : 


1. 

(a- 

• V - 

+ ab2f  j ,k-l  ,SL 

(38) 

2. 

(b2 

- f S:J  £jk* 

~ gj  l 

(39) 

3. 

(a  - 

• biV  - 

f jk£  * 

(40) 
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The  solution  for  f proceeds  in  planes,  outward  from  the  wing  surface,  using 
the  tridiagonal  Eqs.  (38)  and  (39).  The  third  stage  (40)  proceeds  inward, 
solving  for  the  correction  in  a bidiagonal  march. 

Again,  the  main  problem  is  how  to  initiate  the  first  stage.  In  Reference 
7,  the  boundary  condition  used  for  f is 


f j >0 , £• 


= 0. 


(41) 


We  can  again  consider  the  more  general  boundary  conditions  studied  previously, 


j > o > a 


Yf j,i,t 


(42) 


or 


_ 6 T A 

bj 


(43) 


corresponding  to  Eqs.  (18)  and  (30),  respectively.  Actually,  condition  (42) 
can  only  be  approximately  modeled  in  the  3-D  problem,  with  some  splitting 
error  in  the  first  two  factors.  That  is,  we  can  approximate  Eq.  (42)  by 
solving,  at  k = 1: 


-frs22)  -“Vj.i.t 


(44) 


[(l-r)  b2  -f  - s 


'j*' 


(45) 


Equation  (43)  is  easily  implemented  by  replacing  u in  Eq.  (38)  by  m + 8 
and  setting  f = 0. 
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Stability  Analysis  of  the  3-D  AF2YR  Scheme 

A VN  analysis  of  the  3-D  interior  scheme  shows  that  VN  stability  is 
achieved  under  restrictions  (16)  and  (17).  VN  analysis  of  the  boundary  scheme 

(42)  shows  that  sufficient  conditions  for  stability  of  the  VN  boundary  scheme 
are: 

0 < u < 1 - Y (46) 

and 

Y < 1.  (47) 

The  same  criteria  are  obtained  in  the  2-D  counterpart  of  the  AF2YR  scheme  with 
boundary  condition  (18).  The  corresponding  criteria  for  boundary  condition 

(43)  are: 

0 < to  + 8 < 1.  (48) 

At  this  time  we  have  no  numerical  experiments  to  test  the  stability  and 
convergence  of  the  3-D  boundary  conditions  (42)  or  (43)  and  the  criteria  (46) 
or  (48).  However,  some  comments  about  the  use  of  AF2YR  versus  AF2Y  are  in 
order. 

In  the  AF2YR  scheme,  the  use  of  boundary  condition  (42)  or  (43)  makes  the 
scheme  parabolic  at  the  surface;  i.e.,  the  time-like  equation  at  the  boundary 
is: 

cr<j>t  = V2<{>  (49) 

where 

a = b2  (1-y)/(o  (50) 
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for  Eq.  (42),  and  where 


(51) 


0 = b2/(u)  + g) 

for  Eq.  (43).  In  the  case  of  AF2Y,  the  boundary  equation  remains  hyperbolic, 
like  the  interior  scheme,  with 


- °<t>yt  = V2*  (52) 

where 

a = b2/n.  (53) 


It  is  felt  that  for  this  reason  AF2Y  may  lead  to  faster  convergence.  It  would 
appear  that  there  is  no  difficulty  in  implementing  such  a scheme  in  3-D,  as: 


1. 


t“  + blSy>  £jkl»  ‘ 


(54) 


2.  (ab0  - c6  ) 
^ ' zzJ 


fjk£ 


ab9Ac(> 


2 Yj ,k+l ,£ 


(55) 


3.  (1 


ab2  xx 


O ^ 


jk£  gj£* 


(56) 


The  factors  in  the  second  and  third  stages  could  also  be  interchanged.  The 
first  stage  is  initiated  by  using  Eq.  (20),  and  the  same  stability  and 
restrictions  (26)  and  (27)  hold. 
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CONCLUDING  REMARKS 


We  have  studied  the  stability  of  the  AF2Y  scheme  with  several  boundary 
conditions  for  the  intermediate  variable.  The  von  Neumann  method  provides  a 
useful  tool  for  this  study  in  view  of  the  Goldberg-Tadraor  theorem,  and  the 
results  were  verified  in  the  two-dimensional  case  by  the  more  complete  GKSO 
theory. 

In  general,  the  boundary  schemes  place  a limitation  on  a and  m which 

is  more  restrictive  than  the  requirements  for  the  interior  scheme*  Since 

small  a is  desirable  to  damp  low-frequency  errors,  one  strategy  involves 
increasing  a at  or  near  the  boundary  to  meet  the  boundary  restriction  while 

using  smaller  cx  in  the  interior  mesh.  Such  ,fdiscontinuous-a"  schemes 

require  further  analysis  of  the  stability  at  the  line  next  to  the  discon- 
tinuity since  the  scheme  there  is  no  longer  the  interior  scheme.  They  diverge 
on  certain  stretched  grids.  A safer  strategy  is  to  decrease  m at  the 
boundary  to  conform  to  the  restrictions.  This  results  in  a more  robust 

scheme;  and  it  does  not  appear  to  suffer  much,  if  any,  loss  in  convergence 

rate. 

In  regard  to  the  3-D  AF2Y  scheme,  the  current  implementation  in  the  TWING 
code  involves  a reversal  of  the  factors  from  the  2-D  TAIR  code.  We  refer  to 
this  scheme  as  AF2YR.  Although  the  reversal  of  the  factors  makes 
no  difference  in  the  interior  (for  the  linear  constant-coefficient  case), 
there  is  a significant  difference  at  the  boundary.  The  AF2YR  boundary  scheme 
is  parabolic  in  time  as  opposed  to  hyperbolic  for  AF2Y.  For  this  reason, 
there  may  be  a preference  for  the  AF2Y,  as  in  the  TAIR  code. 
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APPENDIX  A 


Application  of  the  GKSO  Theory  to  the  AF2Y  Scheme 

In  the  GKSO  theory  [10],  [11],  the  interior  and  boundary  schemes  are 
considered  as  a coupled  problem.  Instead  of  substituting  the  Fourier 

solutions  as  in  Eq.  (22),  the  class  of  trial  solutions  is  extended  to 

, n „n  i( jpAx)  k , . . . 

(J>jk  = G e Jl^  'p  (Al) 

where  p is  a complex  number  not  restricted  to  lie  on  the  unit  circle  in  the 
complex  plane.  Fourier  modes  are  retained  in  the  direction  tangential  to  the 
boundary  under  study.  The  trial  solutions  are  substituted  into  the  interior 
and  boundary  schemes,  Eqs.  (9)  and  (20),  to  obtain,  respectively: 

[a  + bx(l  - i)]  [-  ab2(y  - 1)  + 2A]  (G  - 1)  = au>[-2A  + b(p  - 2 + i)]  (A2) 


and 


[-  ab2(y  - 1)  + 2A](G  - 1)  = Cl[-2A  = 2b(p  - 1)],  (A3) 

where  Eq.  (8)  for  y *s  used  for  the  right-hand  side  of  Eq.  (A3)  and 

where  we  have  used  the  notation  of  Eq.  (24). 

Equations  (A2)  and  (A3)  are  two  simultaneous  equations  for  the  unknowns 
G and  p.  In  the  theory,  we  are  concerned  only  with  values  of  p inside 
the  unit  circle,  i.e.,  only  those  solutions  which  decay  away  from  the 
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boundary.  If  the  solution  of  Eqs.  (A2)  and  (A3)  yield  G > 1 for  y <1, 
the  scheme  is  unstable. 

If  Eq.  (A3)  is  divided  into  Eq.  (A2),  G is  eliminated;  and  there  results 
an  equation  for  y : 

n[ay  + b^(y  - 1)][-2A  + 2b  (y  - 1)]  = aw[-2yA  + b(y  - 1)^].  (A4) 

First,  it  will  be  shown  that  for 


A = a(l  - cos  O = 0 , 

there  is  a value  of  y inside  the  unit  circle.  When  A = 0,  Eq.  (A4)  reduces 
to  two  linear  factors: 

(y  “ 1){[2£2  (a  + bj)  - am]y  - 2b^  £2  + am}  = 0.  (A5) 

The  root  y = 1 is  a solution  of  Eqs.  (A2)  and  (A3)  only  when  £2  = uj, 
corresponding  to  y = !•  (See  Eq.  (21).)  Then 

G = 1 - a)  (A6) 


and  the  restriction  (16)  must  be  satisfied.  The  other  root  is: 


2b^£2  - au) 

y 2£2  (a  + bj)  - au 


(A7  ) 


which  is  less  than  1.0  and  is  arbitrarily  close  to  1.0  as  a approaches  zero. 
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Using  Eq.  (A3),  we  can  show  that  for  any  complex  y such  that  its  real 

part  is  less  than  1,  G <1  if  and  only  if  restrictions  (26)  and  (27)  are 

satisfied.  Thus,  let 

y = yR  + iy-j-  (A8) 

where  y and  yT  are  the  real  and  imaginary  parts.  Substitution  of  Eq. 
K x 

(A8)  into  (A3)  and  multiplication  by  b^  gives: 

[ab  (l  - yR)  + 2Ab^  - iabvt^](G  - 1)  = - 2Qb^[A  + b (1  - yR)  - iby^.].  (A9) 

2 

The  condition  G <1  then  yields: 

$l{ A^bj(2  - Q)  + Ab(l  - yR)[a  “ ftb^  + (2  - fl)bjJ 

+ b^(ot  - Hbj)[(l  - yR)^  + y^  ] } > 0.  (A10) 

For  y < 1,  the  restrictions  (26)  and  (27)  are  sufficient  to  ensure  the 

inequality  (A10)  for  arbitrary  positive  values  of  A,  b^ , and  b,  regardless  of 
the  magnitude  of  y When  A = 0,  a value  y <1  always  occurs,  as  shown 
by  Eq.  (A7);  and  the  scheme  will  be  unstable  unless  restriction  (27)  is 
satisfied.  Thus,  restriction  (27)  is  necessary;  and  when  it  is  satisfied  (for 
small  a),  restriction  (26)  will  also  be  satisfied. 
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Abstract 

The  time  dependent,  isentropic,  quasi-one-diraensional  equations  of  gas 
dynamics  and  other  model  equations  are  considered  under  the  constraint  of 
characteristic  boundary  conditions.  Analysis  of  the  time  evolution  shows  how 
different  initial  data  may  lead  to  different  steady  states  and  how  seemingly 
anomalous  behavior  of  the  solution  may  be  resolved.  Numerical  experimentation 
using  time  consistent  explicit  algorithms  verifies  the  conclusions  of  the 
analysis.  The  use  of  implicit  schemes  with  very  large  time  steps  leads  to 
erroneous  results . 
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INTRODUCTION 


Consider  a steady,  isentropic  flow  in  a dual-throat  nozzle  with  equal 
throat  areas,  and  assume  that  the  flow  is  choked;  then  it  is  well  known  [1] 
that  the  flow  between  the  throats  can  be  either  completely  subsonic  or 
supersonic  depending  on  the  initial  state  of  the  flow  and  the  path  taken  to 
reach  the  steady  state.  If  we  experiment  numerically  with  the  above  problem 
using  either  the  isentropic  quasi-one-diraensional  gas  dynamics  equation  or 
some  "simpler"  model  equation,  then  some  of  the  results  obtained  are  rather 
peculiar. 

(1)  If  the  initial  data  correspond  to  sufficiently  high  supersonic  flow  (or 
sufficiently  low  subsonic  flow),  then  the  steady  state  flow  obtained 
between  the  two  throats  is  indeed  completely  supersonic  (subsonic). 

(2)  If  the  initial  data  are  completely  supersonic  (or  subsonic),  but  below  a 
certain  level  (above  a certain  level),  then  the  steady  state  flow 
contains  a shock  wave  connecting  the  supersonic  branch  of  the  solution 
to  the  subsonic  branch.  For  the  model  equations  considered,  the  shock 
corresponds  to  an  isentropic  jump,  and  its  location  depends  on  the 
initial  data. 

(3)  Results  (1)  and  (2)  above  are  observed  when  time  accurate  schemes  are 
used.  However,  the  implicit  backwards  Euler  scheme  with  large  time 
steps  yields  steady  states  that  are  not  reachable  through  a time 
accurate  path  from  any  class  of  nontrivial  initial  conditions.  These 
steady  states  include  not  only  discontinuous  solutions  (as  observed  in 
[2]),  but  also  unstable  smooth  solutions. 
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(A)  The  numerical  treatment  of  boundary  conditions  is  very  important  in 


obtaining  the  proper  results.  For  example,  with  central  space 
differencing  one  may  have  a stable  algorithm  that  does  not  converge  in 
time  to  a steady  state  if  the  sonic  conditions  are  invoked  in  order  to 
supply  numerical  boundary  conditions. 

The  purpose  of  this  paper  is  to  present  our  findings,  and  to  provide,  where 
possible,  a mathematical  explanation  of  the  observed  behavior,  thereby 
removing  the  apparent  peculiarities.  We  will  show  that  the  nonuniqueness 
aspect  of  the  steady  state  solution  is  a by-product  of  the  fact  that  the 
boundary  conditions  for  the  evolution  equations  are  prescribed  along 
characteristic  curves.  This  is  true  for  the  dual  throat  problems  due  to  the 
sonic  conditions  imposed  at  the  throats.  The  model  problems  were  therefore 
chosen  to  show  this  behavior. 

In  Section  2 we  study  the  model  equation 


9u  9_ 
9t  9x 


2 

(j-)  = u(l  - u). 


The  relevance  of  this  model  equation  to  the  quasi— one-dimensional  gas 
dynamical  equations  is  somewhat  peripheral.  However,  it  is  rich  in  the  number 
of  possible  steady  solutions  that  it  admits,  including  unstable  continuous  and 
discontinuous  solutions.  In  this  section  we  discuss  the  proper  way  to 
formulate  the  characteristic  boundary  conditions  for  first  order  quasi-linear 
hyperbolic  equations. 
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In  Section  3 we  consider  the  model  equation 


au  3_ 
3 1 3x 


2 

(•g— ) = sin  x cos  x. 


This  model  equation  has  solutions  which  qualitatively  behave  like  those  of  the 
isentropic  dual  throat  nozzle  problem.  The  simplicity  of  the  model,  however, 
affords  a detailed  study  of  the  possibilities  for  anomalous  behavior.  This 
model  equation  will  also  show  us  how  to  quantify  such  vague  terms  as 
sufficiently  high  (or  low)  supersonic  (subsonic)  initial  conditions  that  were 
mentioned  in  (1)  and  (2)  above.  These  results  are  summarized  in  Theorems  1 
and  2. 

In  Section  4 a model  scalar  equation  is  developed  which  has  all  of  the 
interesting  physical  aspects  of  the  complete  isentropic  quasi-one-dimens ional 
gas  dynamic  equations  governing  the  dual  throat  nozzle  problem.  To  develop 
this  equation,  our  guideline  was  to  retain  the  differential  equation 
exhibiting  the  characteristic  boundary  condition  and  to  model  the  other 
dependent  variable  by  assuming  constant  total  enthalpy  during  the  time 
evolution.  By  comparing  the  theoretical  results  of  the  model  equation  to 
numerical  calculations  for  the  complete  system  of  equations,  this  section 
shows  that  the  proposed  single  equation  is  indeed  a good  model  of  the  complete 
system.  Here,  by  the  "goodness"  of  the  model  we  mean  that  all  of  the 
important  features  of  the  system  are  retained. 

Recently  Kreiss  and  Kreiss  [4]  have  investigated  the  above  model 
equations  in  the  presence  of  a linear  dissipative  term  of  the  form  GUXX* 
They  show  that  in  this  case  the  solution  is  unique  and  discuss  the  convergence 
properties  of  their  numerical  scheme. 
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2.  FIRST  EXAMPLE 


Here  we  consider  the  scalar  hyperbolic  partial  differential  equation 


9 u 8 

9 1 9x 


2 

(j“)  = u(l  ~ u), 


O^x^l,  t > 0, 


u(x,0)  = g(x). 


(2.1) 


For  reasons  mentioned  in  the  introduction,  and  to  be  discussed  in  detail  in 
Section  4,  we  are  interested  in  cases  that  model  physical  situations  in  which 
the  boundaries  are  characteristic.  In  practice,  when  (2.1)  is  solved 
numerically  as  a characteristic  boundary  value  problem,  the  boundary 
conditions  are  imposed  dynamically  as  follows: 


u(0,t) 


u(l,t) 


( 0 if  u(eQ,t)  >0  (eq  = Ax) 

(unspecified  if  u(eQ,t)  _<  0 
0 if  u(e1,t)  < 0 (e^  = 1 - Ax) 

w unspecified  if  u(e^,t)  0 


(2.2a) 


(2.2b) 


There  are  two  families  of  continuous  steady  states  satisfying  (2.1)  and  the 
analytical  versions  of  (2.2): 


u = 0 


(2.3) 


u = 1 - en“x  (0  < n < 1). 


(2.4) 
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The  stability  theory  of  ordinary  differential  equations  applied  to  the 
characteristic  equation  du/dt  = u(l  - u)  easily  shows  that  the  steady  state 
solution  u = 0 is  unstable. 

There  are  also  weak  solutions  connecting  various  branches  (different  p's) 
of  (2.4).  These  discontinuous  solutions  are  unstable  as  will  be  demonstrated 


now.  Let 


(2.5) 


be  a steady  state  corresponding  to  n = 


(2.6) 


be  another  branch. 


Since  we  want  to  rule  out  "expansion  shocks,"  i.e.,  discontinuities  that  do 
not  obey  the  "entropy  condition"  u^  > 0 > u^,  we  will  consider  only  the  case 
of  1 > h j 0,  although  the  analysis  is  unchanged  if  < Pj  • For  a 

+ u^(xg)  = 0.  This  determines  the  shock 

location,  xg,  to  be 


steady  state  shock  we  require 


UL(XS) 


x 


S 


(2.7) 


We  now  ask,  what  will  be  the  shock  speed,  Xg  = 
perturbed  to  Xg  + e ? Upon  substituting  the  perturbed  shock  position  in 
(2.5)  and  (2.6),  we  get  for  the  new  shock  speed 


2 (uL  + V*  if  XS 


is 
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(2.8) 


UL  + Ur  _ i -e  nr  2. 

^ = 1 - e « e + 0(e  ). 

Thus,  if  e > 0 (e  < 0)  the  shock  will  move  to  the  right  (left),  showing  that 
the  solution  with  a shock  is  not  stable. 

We  have  thus  shown  that  in  the  steady  state  we  need  consider  only  the 
smooth  solutions  in  (2.4).  We  will  now  demonstrate  that  these  solutions  are 
reachable  from  initial  data.  The  demonstration  is  first  done  for  the  case 
n = 0,  g(x)  > 0 for  all  x > 0,  and  g(0)  = 0. 

Consider  the  problem  (2.1),  and  let 


g(x)  = b(l  - e x),  b > 0. 


(2.9) 


The  solution  to  this  problem  is  readily  verified  to  be 


l - -x 

u(x,  t)  = b - — . 

e + b(l  - e ) 


(2.10) 


Clearly,  as  t ~,  u(x,t)  1 - e x,  which  is  a proper  steady  state. 

Suppose  now  g(x)  is  not  a multiple  of  the  steady  state  but  is  a general 
initial  condition  still  satisfying  g(0)  = 0,  g(x)  > 0.  The  characteristic 
equations  are 


dx  _ 
cTt  “ 


u 


(2.11) 


#-ud  -u). 


(2.12) 


From  (2.12)  one  gets 


K(g) 

g(?)  + (l  ~ g(?))e 


(2.13) 
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where  % - ?(x,t)  is  the  origin  of  the  characteristic  passing  through  x 
and  t.  By  inserting  (2.13)  in  (2.11)  and  integrating  again  along  the 
characteristic,  we  get  the  following  implicit  relation  between  £,  x and  t: 

gX-^-t  = [g(5)  + (i  _ g(5))e"t]  (2.14) 

or,  upon  rearranging 

x-£  . 

g(5)=^r (2.15) 

e - 1 

The  argument  is  now  as  follows:  x - £ is  finite  (0  x-£  < 1),  and  thus  as 

t •>  “,  g(5)  + 0,  but  g(£)  -*•  0 only  for  £ + 0.  Hence,  for  any  finite  x, 

as  t increases,  g(5)  takes  the  large  time  asymptotic  form  of 

g(5)  ~ 6 ■ -■  1 (t  » 1).  (2.16) 

e - 1 

Substituting  (2.16)  in  (2.13)  we  get 

u(x,t)  ~ — 6 (t  » 1).  (2.17) 

1 - e 

Thus,  as  t + <*>,  u(x,t)  •>  1 - e x regardless  of  the  detailed  form  of  the 
initial  data. 

For  other  types  of  initial  data  (e.g.,  g(x)  = 0 for  some  x = xq),  the 
proof  is  the  same  with  n = Xq  and  the  coordinate  x transformed  to 


If  g(x)  has  several  simple  zeros,  then  the  interval  0 x 1 is  sub- 
divided by  the  zeros.  Their  relative  locations  will  determine  the  proper  n. 
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In  particular,  if  g(x)  is  a periodic  function,  g(Xj)  = 0,  with 
xj  = ^ » j = 0,1..., N,  then 


if 


or 


i 


(i)  sgn  g'(0)  = sgn  g'(l)  > 0 
(ii)  sgn  g'(0)  = - sgn  g'(l)  < 0 


(2.18a) 


and 

= Vi 

if 

(i)  sgn  g'(0)  = sgn  g'(l)  < 0 
or 

(ii)  sgn  g'(0)  = - sgn  g'(l)  > 0 


(2.18b) 


(where  primes  denote  differentiation  with  respect  to  the  independent 
variable).  In  summary,  this  example  demonstrates  the  richness  of  possible 
steady  state  solutions. 

(1)  There  is  an  unstable  smooth  solution,  u = 0. 

(2)  There  are  unstable  discontinuous  solutions. 

(3)  There  is  a one-parameter  family  of  smooth  steady  states, 

u = 1 - eT1'X 

with  the  value  of  the  parameter  depending  only  on  the  initial  data,  a 
direct  consequence  of  the  problem  having  characteristic  boundary  values. 
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It  is  interesting  to  note  that  if  the  right-hand  side  of  equation  (2.1) 
is  taken  to  be  u(u-l),  instead  of  u(l-u),  then  there  is  only  one  possible 
stable  steady  state  solution  satisfying  the  boundary  conditions  (2.2),  namely 

u = 0. 

Note  that  this  was  one  of  the  unstable  solutions  of  the  previous  case. 


2.1  NUMERICAL  RESULTS  FOR  THE  FIRST  EXAMPLE 

2.1.1  Explicit  Form 

The  conservative,  upwind,  first  order  scheme  of  Engquist-Osher  (E-0), 
[3]  is  used  to  approximate  the  hyperbolic  system  of  conservation  laws 
represented  by 

If ♦ h (r)  - »<*■“>  (2a9) 


where  h is  a source  term.  Let  u"  represent  the  discrete  value  of  u at 
tn  = nAt  and  x.^  = iAx.  The  explicit  E-0  scheme  for  equation  (2.19)  is, 

I (1  - «i+1)(u"+1)2  + Vu")2  - j (1  + 6i-l)(ui-l)2 


n+1  _ n 1 At 
ui  ui  " 2 Ax 


+ h(iAx,u”)At 


(2.20) 


where  the  switch  function  6^  is  defined  by 
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0 


6 


i 


n 


u. 


= 0 


n 


* 0. 


As  usual,  At 


satisfies  the  Courant— Friedrichs— Lewy  condition, 


(2.21) 


At  < — X n , (2.22) 

max | u^ | 

and  Ax  = L/100,  where  L is  the  length  of  the  interval  of  interest.  For  the 
explicit  E-0  scheme  convergence  was  established  according  to  the  criterion 


max|u"+1  - uj|  < 1.  x 1(T3.  (2.23) 

i 1 

The  relation  given  by  (2.23)  is  equivalent  to  requiring  the  steady  state 
operator  of  (2.20)  to  be  less  than  10  Figure  1 compares  the  exact  and 
computed  steady  states  for  equation  (2.1)  with  initial  conditions* 


g(x)  = - sin  2irx.  (2.24) 

Note  that  the  steady  state  satisfies  the  condition  (2.18bi)  and  that  the 
initial  conditions  and  steady  state  solution  are  such  that  no  boundary 


Note  that,  because  of  the  first  order  accuracy  of  the  Engquist-Osher 
scheme,  Figure  1 shows  a slight  discrepancy  between  the  analytic  and  numerical 
solution.  The  same  problem  run  with  x = 1/1000  gives  results  that,  on  the 
scale  of  Figure  1,  are  indistinguishable  from  the  analytic  results.  This 
comment  holds  for  all  other  numerical  experiments,  where,  in  order  to  save 
computer  time,  we  used  100  mesh  points. 


66 


conditions  are  imposed  at  either  end  of  the  interval.  The  same  steady  state 
is  also  obtained  with 

g(x)  = -x(x-l)(x  - J ).  (2.25) 

Figure  2 compares  the  exact  and  computed  steady  states  for  initial  conditions 

g(x)  = sin  2ttx.  (2.26) 

The  steady  result  is  in  agreement  with  the  condition  (2.18ai). 


2.1.2  Implicit  Form 

The  slow  convergence  to  steady  state  characteristic  of  explicit  schemes 
has  stimulated  research  into  various  acceleration  techniques.  One  of  the  most 
promising  avenues  for  acceleration  consists  of  recasting  the  discrete  equation 
in  implicit  form.  If  we  define  the  increment  in  time  of  u by 

Aui  = uf1  - uj,  (2.27) 


then  the  E-0  scheme  in  implicit  form  is 


i (i  - s1+iK+i  4ui+i + (tf + 6i u"  ~ (Hr)"  “)4ui ' i + ViK-i  “"i-j 


1 

J 


J (!  - 6i+lXui+l)2  + 6i(ui)2  “ J t1  + 6i-l)(ui-l)2  + h(iAx,u")Ax, 


(2.28) 
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where  6^  is  defined  as  before  by  equation  (2.21).  To  obtain  equation 

(2.28) ,  terms  of  order  and  higher  are  neglected.  It  is  easy  to  see,  by 

comparing  equations  (2.20)  and  (2.28),  that  the  right-hand  side  of  equation 

(2.28)  is  the  steady  state  operator.  For  the  implicit  E-0  scheme  convergence 
was  established  by  requiring  that  the  steady  state  operator  be  less  than  10"^ 
at  all  mesh  points. 

Figure  3 shows  the  steady  state  solution  obtained  using  the  implicit 
E-0  scheme  with 

g(x)  = sin  2ttx  (2.29) 

and  using  infinite  Courant  number  [—  = 0).  The  steady  state  obtained  with 
the  implicit  form  of  the  scheme  corresponds  to  one  of  the  unstable  solutions 
of  equation  (2.1).  The  stable  solution,  for  g(x)  corresponding  to  equation 

(2.29) ,  was  shown  in  Figure  2.  The  peculiar  behavior  of  the  implicit 

algorithm  at  large  Courant  numbers  is  further  demonstrated  in  Figure  4 for 

g(x)  = - x(x  - l)(x  - j)  (2.30) 

and  infinite  Courant  number.  For  this  case,  the  steady  state  reached  by 

(2.28)  consists  of  a combination  of  stable  and  unstable  steady,  piecewise 
solutions  of  equation  (2.1). 
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O Computed 
Exact,  unstable 


Figure  3.  Exact  and  computed  unstable  steady  states  for  equation  (2.1) 
with  initial  conditions  (2.29)  using  an  implicit  scheme  with 
large  Courant  number. 
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Figure  4.  Computed  steady  state  for  equation  (2.1)  with  initial 
conditions  (2.30)  using  an  implicit  scheme  with  large 
Courant  number. 
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3.  SECOND  EXAMPLE 


We  now  shift  our  attention  to  another  advection  problem.  The  steady 
states  of  this  problem  are  of  a completely  different  nature  than  of  those 
found  in  the  previous  example. 

The  partial  differential  equation  under  consideration  is 

2 

^ (•j— ) = sin  x cos  x,  O^x^tr,  t>0  (3.1) 


u(x,0)  = g(x),  g(0)  = g(  tt)  = 0 


with  boundary  conditions  as  given  by  (2.2). 

Here  we  have  two  smooth  steady  state  solutions, 

u+  = sin  x 
u = - sin  x. 


(3.2a) 


There  is  also'  an  infinite  number  of  possible  discontinuous  solutions  of  the 
form 

+ v 

U = U X < Xg 

, (3.2b) 

u = u x > Xg 

where  xs,  the  "shock"  location,  is  an  arbitrary  point  in  the  interval 
(0,ir).  Note  that,  in  the  steady  state,  the  "shock"  speed  Ug  = (u+  + u )/2 
is  zero  for  any  0 < Xg  < it  and,  therefore,  (3.2b)  is  a legitimate  steady 

state  solution.  In  the  above  solutions  we  have  already  eliminated  weak 

solutions  that  violate  the  "entropy  condition,"  u^  > 0 > u^. 
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We  now  ask  two  questions: 


(i)  From  what  class  of  initial  conditions,  if  any,  can  either  of  the  two 
smooth  solutions,  (3.2a),  be  reached  and 
(ii)  Under  what  circumstances  is  a steady  shock  established,  and  can  its 
location  be  predicted? 

Consider  first  the  two  questions  in  the  particularly  simple  case  when 

g(x)  = 0 sin  x,  (3.3) 

i.e.,  the  initial  data  are  proportional  to  a smooth  steady  state.  For 

S > 1,  Theorem  1 shows  that  the  steady  state  is  the  smooth  solution  u = 
u+.  For 

0 < -1,  a corollary  of  Theorem  1 leads  to  u = u“. 

Theorem  1 : The  solution  of  equation  (3.1)  with  boundary  conditions 

(2.2),  initial  conditions  (3.3)  and  0 > 1 satisfies 

lim  u(x , t ) = sin  x. 

t+oo 


Proof  • The  characteristic  equations  resulting  from  (3.1)  are 


dx 

dt  " 


u 


(3.4) 


du  _ du  _ clF 
dt  U dx  dx  * 


F = j sin2  x. 


(3.5) 
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Again  using  £ = g(x,t)  to  designate  the  origin  of  a characteristic  curve 
passing  through  (x,t),  we  integrate  (3.5) 

j u2  - ± g2(0  = F(x)  - F(?) 
or 

u = ±[2F(x)  - 2F(5)  + g2(5)]1/2  • (3.6) 

As  t 0,  ? -*■  x and  we  have  to  choose  the  positive  branch  of  (3.6)  because 

9 

3 > 1.  Thus,  using  F = (1/2)  sin  x, 

u = [sin2  x + (&2  - l)sin2  g]*^2.  (3.7) 


We  claim  now  that  for  t large  enough  there  is  a unique  correspondence 
between  a point  (x,t)  and  £(x,t).  In  fact,  if  a shock  wave  were  to  appear 
at  a certain  time  t > 0,  it  will,  because  of  (3.7),  separate  two  positive 
states.  The  shock  wave  will  have  a positive  speed  and  consequently  will 
propagate  out  of  the  domain.  Therefore,  for  t large  enough,  we  may 
substitute  (3.7)  into  (3.4), 


t = r 

5 [ 2F(y)  - 2F(5)  + g2(g)]1/2 


£ [sin2  y + (g2  - l)sin2  g]1^2 


(3.8) 

(3.9) 


For  every  x < n , the  integrand  in  (3.9)  cannot  become  singular  except  at  the 
lower  limit  y = £ > ? + 0*  Thus,  t “ as  £ + 0 and  the  only  possible 
solution  for  very  large  time  is,  from  (3.7), 
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u ££  [ 2F(x)  - 2F(g)  + g2(?)]1/2  = [2F(x)  - 2F(0)  + g2(0)]1/2  = sin  x, 

5*0 

which  completes  the  proof. 

Corollary:  Suppose  that  $ in  (3.3)  satisfies  & < -1 , then 

lim  u(x,t)  = - sin  x. 


Note  that  in  view  of  (3.8)  the  results  of  Theorem  1 hold  for  any  initial 
conditions  g(x)  such  that  g(0)  = 0,  g(x)  > sin  x.  The  corollary  is  thus 
also  valid  for  any  g(x)  < - sin  x. 

Still  continuing  with  the  case  of  g(x)  = 6 sin  x,  we  now  consider 

0 < B < 1.  (3.10) 

Here  the  steady  state  will  be  of  the  form  (3.2b).  We  will  show,  however,  in 
Theorem  2 that  the  shock  location  depends  on  the  initial  condition. 


Theorem  2 : The  solution  of  equation  (3.1)  with  boundary  conditions 

(2.2),  initial  conditions  (3. 3), and  0 < B < 1 satisfies 

u+  = sin  x,  0 < x < x_ 

(3.11) 

u = -sin  x,  Xg  < x < it 


lim  u(x,t)  = •; 

t+“  / 
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where 

Xg  = it  - sin  * / 1 - g2  > -j  . (3.12) 

Proof : From  the  characteristic  equation  (3.5),  with  0 < g < 1,  we  get 

u(x,t)  = ±[sin2  x - (1  - g2)sin2(^(x,t))]1//2.  (3.13) 

In  the  interval  (it  - xg , xg),  Xg  as  defined  in  (3.12),  u(x,t)  cannot  change 
sign  because  the  radical  in  (3.13)  cannot  vanish  in  said  interval.  Since  as 
t -*•  0,  u(x,t)  is  positive,  we  conclude  that 

u(x,t)  = [sin2  X - (1  - g2)sin2(5(x,t))]^2,  Tr~xg  < x < xg.  (3.14) 

In  this  interval  the  first  characteristic  equation  (3.4)  becomes 

t'/X  2 ~2 2 172  0.15) 

K [sin  y - (1  - 8 )sin  (f[(x,t))] 

since  t > 0 we  must  have  5 < x when  Tr-xg  < x < Xg.  As  t + «>,  £(x,t) 
must  therefore  vanish  in  the  limit.  It  is  thus  established  that 

lim  u(x,t)  = sin  x,  (ir-Xg  < x < xg).  (3.16) 

t+°° 

Next  consider  the  interval  [0,tt-x  ).  Formally  as  t in  this  leftmost 

D 

interval,  5(x,t)  must  converge  either  to  zero  or  tt.  However,  any 
characteristic  passing  through  (x,t)  in  the  interval  [0,Tr~xg)  cannot 

emanate  from  any  E,  > xg  because  this  would  mean  a negative  slope,  and  hence 
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a negative  u in  the  interval  (tt-x^,  xg);  this  contradicts  (3.16).  Having 

established  that  lim  £(x,t)  = 0,  we  notice  that  formally  it  is  possible  for 

t+«> 

a characteristic  curve,  originating  in  the  interval  [0,tt-x  ),  to  start  with  a 

O 

positive  slope  (required  as  t 0)  and  change  slope  in  the  interval.  This, 
however,  will  result  in  a solution  containing  a "shock"  that  violates  the 
"entropy  condition"  u^  > 0 > u^.  We  thus  have  our  next  intermediate  result 

lim  u(x,t)  = sin  x,  (0  £ x < xc).  (3.17) 

t+»  b 

It  now  remains  for  us  to  show  that  in  the  interval  xg  < x < tt  the  solution 
must  be  negative  and  hence  equal  to  - sin  x. 

We  first  integrate  (3.1)  to  get 

a u tt  2 

TT  J udx  = “ / (y-)  xdx. 

3t  0 0 2 

Suppose  that  at  the  point  0 < Xj  < x2  • • • < xn  < n,  u(x,t)  is  discontinuous, 
since  we  admit  only  "shock"  discontinuity  u (x^)  > u (x^).  Thus, 

ft  fj  u(x,t)dx  = J u2(0,t)  - | (u2  (*+)  - u2(x"))  - u 2 ( tt , t ) (3.18) 

from  (3.13),  u^(0,t)  = 0 and  therefore, 

IT  IT 

J u(x,t)dx  < J u(x,0)dx  = 2$.  (3.19) 

0 0 

Let  xq  be  the  point  in  which  u(x,«)  changes  sign.  From  (3.16),  we  have 
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and  from  (3.19)  we  have 


x 


s 


< X 

a 


x 

a tt 

J sin  x - J sin  x < 23,  (3.20) 

0 x 

a 


thus. 


-2  cos  < 2g 
a 


or 


x < 
a 


x 


s 


(3.21) 


and  therefore 


x = x . 
a s 


(3.22) 


This  completes  the  proof. 

It  should  be  noted  that,  in  general,  xg  gives  a lower  bound  on  the  location 
of  the  discontinuity  whereas  the  area  rule  (3.19)  yields  an  upper  bound  on  it. 

Corollary:  Under  the  conditions  of  Theorem  2 with 


-1  < 3 < 0 

the  solution  still  retains  the  form  of  (3.11)  except  that  now 

xg  = sin"1  / 1 - 32  < j • 
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For  arbitrary  initial  data  the  general  behavior  is  that  described  in 
Theorems  1 and  2 and  their  corollaries,  i.e.,  one  can  get  either  solution 
(3.2a)  or  (3.2b).  If  a "shock"  is  present  in  the  steady  state,  the  upper  and 
lower  bounds  for  its  location  are  given,  for  g(x)  > 0,  as  follows: 


7T  - sin 


-1 


/ 


sin 


z - 


g2(z) 


XS^ 


tt  - sin 


-1 


/~-T 


(/  g(n)dn)^ 

0 


(3.23) 


where  z maximizes  the  expression  sin2  x - g2(x).  For  negative  initial  data 
the  bounds  are 


sin  1 / sin2  z - g2(z)  < x < sin"1  / 1 -•£•(/  g(n)dn)2  . (3.24) 

0 

The  upper  bound  reflects  the  "area  rule"  (see  (3.18)).  The  lower  bound  is  the 
first  point  where  u(x,t)  can  change  sign.  For  g(x)  > 0,  the  upper  bound 
becomes  sharp  (i.e.,  equals  Xg),  if  u(ir,t)  = 0 for  all  t. 

3.1  NUMERICAL  RESULTS  FOR  THE  SECOND  EXAMPLE 

3.1.1  Explicit  Form 

Equation  (3.1)  is  discretized  using  the  explicit  E-0  scheme  given  by 
equation  (2.20).  Numerical  calculations  were  performed  for  initial  conditions 
given  by 

g(x)  = 8 sin  x,  (3.25) 
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where  g is  a free  parameter  such  that  0 _<  g < 1.  The  steady  state  shock 
position  as  a function  of  g is  plotted  in  Figure  5.  The  numerical  results 
are  in  excellent  agreement  with  the  theoretical  prediction  given  by  equation 
(3.12).  For  any  g > 1,  the  steady  state  obtained  was  u+  given  by  equation 
(3.2a). 

If  one  uses  an  algorithm  employing  central  space  differencing  (e.g., 
MacCormack's  scheme),  it  is  then  necessary  to  supply  a numerical  boundary 
condition.  If  the  steady  state  value  is  used  for  the  boundary  condition,  then 
the  numerical  algorithm,  though  stable,  fails  to  converge  to  steady  state. 
The  reason  is  clearly  due  to  the  fact  that  the  numerical  boundary  condition 
does  not  allow  for  a flux  through  that  boundary.  As  a consequence  we  have 
(see  (3.19)) 

ir 

J u(x,t)dx  = 2g 
0 

for  all  t,  while  the  true  steady  state,  u+,  requires 

7T 

lim  J u(x,t)  = 2. 
t-*-»  0 


3.1.2  Implicit  Form 

Equation  (3.1)  is  discretized  using  the  implicit  E-0  scheme  given  by 
equation  (2.28).  Once  again,  numerical  calculations  were  performed  for 
initial  conditions  given  by  equation  (3.25).  Now  an  additional  free  parameter 
is 


100  Ax 
it  At 


(3.26) 
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which  is  a measure  of  how  big  At  is  taken  in  the  numerical  calculations. 
The  results  of  these  series  of  calculations  are  given  in  Figure  6.  As 
indicated  in  the  figure,  if  "small"  At's  are  taken  (e  >}/i  ),  then  the 
steady  state  shock  location  calculated  agrees  with  the  theoretical  prediction 
of  equation  (3.12).  However,  as  At  increases,  the  steady  state  shock 
position  is  found  to  the  right  of  its  theoretical  location.  For  sufficiently 
high  values  of  At  (small  e's),  the  smooth  solution  is  obtained. 


82 


Figure  6.  Computed  and  predicted  steady  state  shock  position  for 
equation  (3.1)  with  initial  conditions  (3.25)  using  an 
implicit  scheme. 


4.  A MODEL  FOR  QUAS I-ONE-D IMEN S IONAL  FLUID  DYNAMICS 

A characteristic  boundary  value  problem,  where  boundary  conditions  are 
of  the  form  (2.2),  occurs  in  a double-throat  Laval  nozzle 


as  shown  in  Figure  7.  It  is  well  known  [1]  that  there  are  two  possible  smooth 

steady  solutions,  with  sonic  conditions  at  the  throats.  Between  the  throats, 

0 < x < 1,  the  flow  can  be  either  completely  subsonic  or  supersonic,  the  exact 

Mach  number  distribution,  in  each  case,  being  dependent  on  the  nozzle  area, 

A(x) , where  1 < A(x)  < A in  (0,1),  A(0)  = A(l)  =1. 

max 

If  one  considers  the  isentropic  case  only,  then  the  flow  may  be 
described  by  the  quasi-one-diraensional  partial  differential  equations  for  the 
Rieraann  variables. 


♦ ° U + r C, 

Y y - 1 


(j)  = u - r c, 

Y y ~ 1 


1 / 2 

where  u is  the  velocity,  c = (yp/p)  ' is  the  speed  of  sound,  and  y is 
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the  ratio  of  specific  heats  for  ideal  gases.  The  equations  are 


|^jr  + (u  + c)  |^-  + ucF'(x)  = 0,  (4.1) 

+ (u  - c)  - ucF'(x)  = 0,  (4.2) 


where  F'(x)  = dF(x)/dx  = d(£nA(x)) /dx.  This  is  a hyperbolic  system  whose 
time  evolution  is  difficult  to  describe  analytically.  We  therefore  seek  a 
model  for  this  system  so  that  with  a single  equation  the  most  salient 
features  are  retained.  We  will  present  numerical  evidence  that  analytical 
predictions  resulting  from  this  model  equation  agree  very  well  with  results 
found  by  numerical  integration  of  the  original  system  (4.1),  (4.2). 

The  model  is  derived  using  a single  assumption,  namely  that  the  total 
enthalpy  is  constant  not  only  at  steady  state  but  also  during  the  transient 
phase.  The  mathematical  expression  of  this  assumption  is  that 


,2  22(1  -a)  x , _ 

(f)  + + <f)tj)  — 


4cj 


16 


2a  - 1 


Y2  - 1 


(4.3) 


where 


= I 


+ 1 


(4.4) 


Cq  is  the  stagnation  sound  speed,  and  c*  is  the  sonic  sound  speed,  i.e., 
c*  is  the  sound  speed  at  a sonic  throat. 

We  now  face  the  choice  of  solving  (4.3)  for  either  \[>  in  terms  of  (J>, 
or  vice  versa.  This  dilemma  is  resolved  by  recognizing  that  our  "physical" 
problem  will  impose  characteristic  boundary  conditions  on  (4.2),  and  we  would 
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like  our  model  equation  to  retain  this  feature.  Therefore,  (4.2)  is  the 
relevant  equation.  Solving  for  ip  gives 


. - 1 - a 1 

ib  + — 

r a a 


, 2 2 
4a  c* 


2a 


— - (2a  - l)cf> 


1/2 


(4.5) 


where  the  positive  branch  was  chosen  in  order  to  satisfy  the  steady  state 
boundary  condition  at  x = 0,  i.e.,  at  the  first  throat,  where 


2a 

2a  - 1 C*; 


4>*  = " 


2(1  - a) 
. 2a  - 1 


(4.6) 


Using  (4.5)  in  (4.2),  and  defining 


<f>  “ <(>/,l'* 


(4.7) 


the  equation  (4.2)  takes  the  form 

|i+  A(i)  f|  = H(J)F'(X) 


(4.8) 


where 


(4.9) 


(4.10) 


T 


tc 


*• 


(4.11) 


Notice  that  the  time  scale,  t,  is  determined  by  the  sonic  conditions. 
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For.  the  sake  of  clarity  let  us  first  examine  the  simple  case  of  a = 1 
(y  = 3),  which  corresponds  to  the  flow  of  products  caused  by  detonating  solid 


explosives.  Equation  (4.8)  then  becomes 


f£  + ♦ ff  = I (1  ” 2J2)F'(x),  F(x)  = £nA(x).  (4.12) 


A smooth  steady  state  solution  of  (4.12)  with  <f>(0)  =0  is 


l2f  \ 1 “F(xK 

«(.  (x)  = y (1  - e 


(4.13) 


since  A(0)  - 1,  and  so,  as  in  (3.2a)  we  have  two  possible  steady  states.  One 
is  positive  (supersonic)  and  the  other  is  negative  (subsonic): 


:+  _ r A(x)  - 1^1/2 
+ " ^ 2A(x)  > 


(4.14) 


_ _ ( A(x)  - 1^1/2 
" V ”A(x)  J * 


(4.15) 


Bearing  in  mind  the  results  of  the  previous  sections,  we  will  show  that  in  the 

A_ 

time  evolution  problem,  § and  <|)  are  reachable  from  different  initial 
conditions.  Clearly  (4,14)  and  (4.15)  can  be  connected  by  a steady  shock  - 
and  again,  because  of  the  symmetry  of  ^ and  <j>  , the  steady  shock  location 
xs  could  be  anywhere  in  the  interval  (0,1).  We  will  show  that  here  too 

bounds  on  Xg  can  be  found  and  compare  them  with  results  of  numerical 

integration  of  the  original  system  (4.1),  (4.2). 

We  will  concentrate  on  the  positive  branch  (4.14),  showing  that  if  the 
initial  condition  is  given  by 
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with 


1 < 3 


2 


< 


A 

max 


A - 1 ’ 
max 


(4.17) 


where  Amax  is  the  maximum  area  in  the  nozzle,  then 
solution  of  the  second  characteristic  equation, 


lim  <|>(x,t) 


/(x).  A 


'^-<I,'<Ix-t(1“  2*V<x>  (4.18) 


is  given  by 

|l  - 2<f>2 1 = |l  - 2g2(£(x,x))|A(£(x,T))/A(x),  (4.19) 


where  as  before  g(x,x)  is  the  origin  of  a characteristic  curve  passing 
through  (x,t).  Since  we  have  chosen  (see  (4.17))  g2(x)  to  be  smaller 

than  1/2,  it  follows  from  (4.19)  that 


4>(x,{;)  = ± 


A(x)  ••  A(5)Tl  “ 2g  (£(x,x))l 
2A(x) 


1 1/2 


(4.20) 


where  £(x,x)  is  to  be  determined  from  the  first  characterisitc  equation 


x 


? <Ky>5) 


(4.21) 


From  (4.16)  we  see  that  a positive  (negative)  S will  initially  select  a 
positive  (negative)  branch  of  (4.20).  By  an  argument  similar  to  that  used  in 

A 

Theorem  1,  it  remains  for  us  to  show  that  <\>  thus  initiated  will  not  change 


sign  while  evolving  to  steady  state.  This  follows  immediately  from  (4.20)  if 
we  use  for  g(x)  equation  (4.16)  with  g > 1. 

Next  we  consider  the  discontinuous  steady  state  solution.  The  initial 
data  are  now  taken  so  that  |g(x)|  < , see  equation  (4.14).  A lower  bound 
for  xs  is  found  by  inquiring  about  the  zeros  of  (4.20)  - the  argument  is  the 
same  as  in  the  previous  section.  The  radical  in  (4.2)  is  zero 


A(Xg)  = A(z)(l  - 2g2(z) ) 


(4.22) 


where,  as  before,  z maximizes  the  expression  A(x)(l  - 2g^(x)).  To  find  the 

upper  bound  we  have  to  devise  an  "area  rule"  for  equation  (4.12).  Because  of 

_ 1. 
the  structure  of  the  right-hand  side  of  (4.12),  it  is  no  longer  / <|)(x,  T)dx 

0 

which  is  conserved.  To  find  the  appropriate  "area  rule,"  we  divide  both  sides 

+>  2 

of  (4.12)  by  1 2(f>  >0.  The  resulting  equation  after  integration  by  x 

over  the  interval  may  be  written  as 


L_  r\n  L±Y?.  i ' 1 

*T  0 - ~ jj  n 3x 

0 1-/24,  n 0 


dx  “ / |^-[in(l  - 2<j>2)  ]dx  = — F(x) 

/2 


= 0. 


(4.23) 


A 

Under  the  usual  area  rule  assumptions,  <J> (0 , r ) = 4>(l,x)  = 0,  we  have 


1 


1 + /T  $ 

*•  dx  = const. 


/ in 

0 1-/7,}, 


(4.24) 


Therefore,  an  upper  bound  for  xg  is  found  from 


x 


/ S tn  *1  dK  + J1  t„  k+J2.  r dx  - / ‘to  1 * 22 

n 1 /TT  . * - — / 


1-/24, 


1-/24, 


0 1-/2  g(x) 


dx.  (4.25) 
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When  g(x)  _<  g<)>  , (g  < 1)  we  expect,  as  in  the  previous  example,  the  upper 
and  lower  bounds  on  Xg  to  coincide.  This  was  indeed  verified  in  numerical 
experiments  with  a particular  area  distribution  A(x). 

Recalling  that  (4.12)  is  a scalar  model  equation  representing  the 
systems  (4.1),  (4.2),  we  find  it  interesting  to  note  that  this  2x2  system  also 
possesses  an  area  rule,  namely: 


ft  / ty+^dx  = f [(<f/2(l,t)  + <|>2(l,t))  - (<|,2(0,t)  + *2(0,t))].  (4.26) 

Under  the  assumption  that  <|>(0,t)  = <J>(l,t)  = 0;  ij)(0,t)  = ifi(l,t),  we  have 

ft  / (*  + 4>)dx  = 0.  (4.27) 

We  can  now  use  this  to  test  the  "goodness"  of  our  model  by  comparing  the  shock 
location  predicted  from  (4.25)  with  that  of  the  system,  whose  solution  is 
found  numerically.  This  comparison  is  carried  out  in  the  next  section. 

Having  concluded  the  analysis  of  the  a = 1 case,  let  us  now  return  to 
the  more  general  formulation  (4.8).  In  particular,  let  us  consider  the  case 
of  y = 1.4  (a  = .6),  corresponding  to  air.  We  next  show  how  (4.8)  may  be 
cast  in  a form  similar  to  the  "decoupled"  one  in  (4.12).  Multiply  both  sides 
of  (4.8)  by  r'(<f>)  (r'  = dr/ d<J> ) to  obtain 

A 

|£  + r |£  = .Hj>lr2l  F'(x)  = K(r)(r+  - r)(r  - r_)F'(x),  (4.28) 

<(>'(r) 


where 


(4.29) 


K(r) 


/--|^2(/l-|r2-|r)(r-/^)(r+y|) 
Hr  - | J 1 r2)(§  r + 5 J\  - | r2) 


(1  - r 


r+  / 2 


r = - 


(4.30) 


The  quantities  r_  and  r+  are  the  values  of  r which,  in  the  steady  state, 
correspond  to  Mach  numbers  of  zero  and  infinity,  respectively.  For  general 
values  of  y,  K(r),  r+ , and  r_  are  replaced  by  K(r,a),  r+(a),  and  r_(a). 
K(r,a)  will  have  the  same  structure  as  in  (4.29). 

It  is  easy  to  verify  that  K(r),  given  by  (4.29),  is  a positive,  slowly 

monotonically  decreasing  function  in  the  relevant  range  r < r < r+.  In  fact 
K(r_)  » 2K(r+)  = .309.  In  the  case  of  y = 3,  i.e.,  equation  (4.12),  r = <j> 
and  we  have  r_^_  = — r_  = 1/Z2  and  K(r)  = constant.  It  is  thus  clear  that 

the  topological  behavior  of  (4.28)  is  the  same  as  that  of  (4.12),  and  the 

<*^8um®nts  carry  over.  In  particular  the  non- unique  smooth  steady  states 
depend  on  the  initial  data  in  the  same  fashion  with  respect  to  8. 


4.1  NUMERICAL  RESULTS  FOR  QUASI  ONE-DIMENSIONAL  EQUATIONS 

Here  we  study  numerically  equations  (4.1)  and  (4.2)  for  y = 3,  namely: 


It  + fr  (f-)  - **  i (*2  _ <i>2)f'(x)  (4.3D 

ft  + (2“)  = i C^2  “ <)>2)F'(x).  (4.32) 
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The  area  of  the  dual-throat  nozzle  is  defined  by 


A(x)  = d)2  tlLz  d<2x  - lff  , 

2(1  - d)(l  - d(2x  - l)2)2 


0 < x < 1, 


(4.33) 


where  d is  a parameter  related  to  the  maximum  area  by 


a = O - d)2  + 1 
max  2(1  - d) 


(4.34) 


For  the  numerical  experiments,  we  have  used  d = 1/6  which  results  in 

= 1.1.  The  steady  state  Mach  number  distribution  is 


M(x)  = A(x)  ± /v2(x)  - 


1 , 


(4.35) 


and  the  steady  state  solution  to  (4.31)  and  (4.32)  as  a function  of  the  Mach 
number  is 

* - / 3 (1  + M)/(l  + M2)1/2  (4.36) 


♦ 


= - /3  (1  - M)/(l  + M2) 


(4.37) 


With  the  stagnation  pressure  and  density  used  as  reference  values,  the  value 
of  t|>*  is  /6. 
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4.1.1  Explicit  Form 


Equations  (4.31)  and  (4.32)  are  discretized  using  the  explicit  E-0 
scheme  given  by  equation  (2.20).  Numerical  calculations  were  performed  with 
initial  conditions  corresponding  to 


(4.38) 


which  is  equivalent  to  (4.16),  and  with 

’Kx.O)  = ^ [~^2A(x) 1 ] V2  ’ <4*39> 

or 

- Z6  (l  - »2(4%=^ |‘/2  <«.«> 

The  initial  conditions  given  by  (4.39)  correspond  to  the  exact,  steady 
solution  for  \p  while  those  given  by  (4.40)  correspond  to  conditions  for  i|/ 
consistent  with  (4.38)  and  constant  total  enthalpy,  (4.5).  The  steady  state 
reached  was  the  same  in  either  case;  therefore,  the  results  reported  here  are 
for  calculations  with  (4.40)  only. 

Figure  8 summarizes  the  numerical  results.  The  figure  compares  the 
predicted  steady  state  shock  position  as  given  by  (4.25)  for  the  model 
equation  (4.12)  and  the  computed  position  for  the  system  (4.31)  and  (4.32). 
As  is  evident  from  the  figure,  the  agreement  is  very  good. 
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4.1,2  Implicit  Form 


Equations  (4.31)  and  (4.32)  are  discretized  using  the  implicit  E-0 
scheme  given  by  equation  (2.28).  Equations  (4.38)  and  (4.40)  are  again  used 
as  initial  conditions.  The  numerical  results  are  summarized  in  Figure  9.  As 
shown  in  the  figure,  the  steady  state  shock  position  depends  on  the  Courant 
number  as  measured  by  the  parameter 

e = 100  . (4.41) 
At 

For  values  of  e >_  10  the  steady  state  shock  position  is  the  same  as  that 
predicted  by  the  explicit  form.  For  values  of  e < 10  (large  At),  the  steady 
state  shock  position  bifurcates  at  certain  values  of  0. 
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Figure  9.  Predicted  steady  state  shock  position  given  by  (4.25)  for 

equation  (4.12)  and  computed  position  for  system  (4.31)  and 
(4.32)  with  initial  conditions  (4.38)  and  (4.40)  using  an 
implicit  scheme. 
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CONCLUSIONS 


In  this  paper  we  analyzed  several  model  equations  for  characteristic 
initial  boundary  value  problems  and  examined  numerically  these  as  well  as  the 
quasi-one-dimens ional  isentropic  Euler  equations  of  gas  dynamics. 

We  showed  that  because  of  the  characteristic  nature  of  the  boundary 
conditions  the  resulting  steady  states,  whether  smooth  or  discontinuous, 
depend  on  the  initial  data.  Different  initial  conditions  may  yield  different 
steady  states.  We  also  gave  an  example  (see  Section  2)  of  solution  to  the 
steady  state  equation  which  cannot  evolve  from  the  initial  data.  Thus  from 
the  point  of  view  of  the  time-dependent  equation,  we  find  there  are  no  non- 
unique  steady  states • 

Another  conclusion  that  one  may  draw  is  that  in  order  to  have  complete 
confidence  in  the  results,  numerical  schemes  for  characteristic  initial 
boundary  value  problems  should  be  time  consistent  and  employ  only  suitable 
boundary  conditions.  Thus  we  have  shown  that  implicit  methods,  even  for 
finite  Courant  numbers,  may  yield  solutions  which  are  piecewise  combinations 
of  non-unique  solutions  of  the  steady  state  equations.  In  fact,  such 
numerically  implicit  algorithms  may  converge  to  solutions  which  also  include 
parts  of  unstable  steady  states. 
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ABSTRACT 

Let  u(x,t)  be  a weak  solution  of  the  Euler  equations,  governing  the 

inviscid  polytropic  gas  dynamics;  in  addition,  u(x,t)  is  assumed  to  respect 

the  usual  entropy  conditions  connected  with  the  conservative  Euler 

equations.  We  show  that  such  entropy  solutions  of  the  gas  dynamics  equations 

satisfy  a minimum  entropy  principle,  namely,  that  the  spatial  minimum  of  their 

specific  entropy,  Ess  inf_s(u(x, t)) , is  an  increasing  function  of  time.  This 

x 

principle  equally  applies  to  discrete  approximations  of  the  Euler  equations 
such  as  the  Godunov- type  and  Lax-Friedrichs  schemes.  Our  derivation  of  this 
minimum  principle  makes  use  of  the  fact  that  there  is  a family  of  generalized 
entropy  functions  connected  with  the  conservative  Euler  equations. 
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1.  INTRODUCTION 


Many  phenomena  in  continuum  mechanics  are  modeled  by  hyperbolic  systems  of 
conservation  laws 

a,  d 

+ l — = o,  (x  = (x^***  ,xd),t)e  R*[0,~),  (1.1) 

k=  1 k 

where  = f^^(u)  = (f are  smooth  nonlinear  flux  mappings 

T 

of  the  N-vector  of  conservative  variables  u=  u(x,t)  = (u^,**«,u^)  • 
Friedrichs  and  Lax  [3]  have  observed  that  the  hyperbolic  nature  of  such  models 
is  revealed  by  the  property  of  most  of  those  systems  being  endowed  with  a 
generalized 


Entropy  Function:  A smooth  convex  mapping  U(u)  augumented  with  entropy  flux 
mappings  F = F(u)  = (F^  ^ (u)  , • • • ,F^^  (u)  ) , such  that  the  following 
compatibility  relations  hold 


jjT  f(k)  a p(k)T 

u u u 1 


k = 1 ,2 , • * • ,d. 


(1.2) 


T 

Multiplying  (1.1)  by  and  employing  (1.2),  one  arrives  at  an  equivalent 
formulation  of  the  compatibility  relations  (1.2),  namely,  that  under  the 
smooth  regime  we  have  on  top  of  (1.1)  the  additional  conservation  of  entropy 


3F 


00 


M+  f _ 

3t  k-1  8xk 


0. 


(1.3) 


(k) 

Owing  to  the  nonlinearity  of  the  fluxes  f (a),  solutions  of  (1.1)  may 
develop  singularities  at  a finite  time  after  which  one  must  admit  weak 
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solutions,  i.e.. 


those  derived  directly  from  the  underlying  integral 


conservative  equations.  Considering  (1.1)  as  a strong  limit  of  the 
regularized  problem. 


— + I 

ii 


af 


(k) 


,xk 


d a2 

v d U 

w ^ ~~2 
k=i  ax. 


U + 0, 


(1.4) 

y 


then  following  Lax  [9]  and  Krushkov  [8],  we  postulate  as  an  admissibility 
criterion  for  such  limit  solutions  an  entropy  stability  condition  which 
manifests  itself  in  terms  of  an 


Entropy  Inequality:  We  have,  in  the  sense  of  distributions, 


3U  5 

9t  lil  3*k  - 


(1.5) 


Weak  solutions  of  (1.1),  which  in  addition  satisfy  the  inequality  (1.5) 
for  all  entropy  pairs  (U,f)  connected  with  that  system,  are  called  entropy 
solutions .d ) Having  a (weakly)  nonpositive  quantity  on  the  L.H.S.  of  (1.5) 
is  thus  a consequence  of  viewing  these  entropy  solutions  as  limits  of 
vanishing  dissipativity  mechanisms.  In  particular,  the  inequality  (1.5) 
implies  that  the  total  entropy  in  the  domain  decreases  in  time  (we  assume 
entropy  outflux  through  the  boundaries) 


/_  U(u(x,t))dx  _<  0. 
x 


(1.6) 


^ ^Krushkov  [8,  p.  241]  has  termed  such  solutions  simply  as  generalized 
solutions.  " 
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In  this  paper,  we  consider  entropy  solutions. 


u = (p ,*,E)T  ( 1.7a) 

of  the  Euler  equations.  These  equations  govern  the  inviscid  poly tropic  gas 

dynamics , asserting  the  conservation  of  the  density  p , the  momentum 
T in 

m = (m^n^,!^)  , and  the  energy  E.  Let  q h — denote  the  velocity  field  of 
such  motion.  Then,  expressed  in  terms  of  the  pressure,  p, 

2 

p = (y-1)*[E  -V2  *p|q|  ]»  Y = adiabatic  exponent,  (1.7b) 

the  corresponding  fluxes  in  this  case  are  given  by^2^ 

= (mk»<lk*®  + p»e^k\qk(E  + p))T,  k = 1,2,3.  (1.7c) 

The  main  result  of  this  paper  asserts  that  entropy  solutions  of  Euler 
equations  satisfy  the  following 

Minimum  Principle:  Let  u = u(x,t)  be  an  entropy  solution  of  the  gas 

dynamics  equations  (1.7)  and  let 

S(x,t)  = S(u(x,t))  = ln(pp-Y)  (1.8) 


^2^With  e^k) 


denoting  the  unit  Cartesian  vectors 


103 


denote  the  specific  entropy  of  such  solution.  Then  the  following  estimate 
holds 

Ess  Inf  S(x,t)  > Ess  inf  S(x,t  = 0).  (1.9) 

| x | <R  I x I <R+t • q 

Here  <Jmax  stands  for  the  maximal  speed  |q|  in  the  domain. 

The  proof  of  this  assertion  is  provided  in  Section  3 below.  Prior  to  that 

we  elaborate  in  Section  2 on  the  entropy  inequality  connected  with  the  gas 

dynamics  equations.  In  particular,  Harten  [5]  has  shown  that  there  exists  a 

whole  family  of  entropy  pairs  associated  with  these  equations,  a fact  which  is 

essential  in  our  derivation  of  the  minimum  principle. 

As  an  immediate  consequence  of  the  minimum  principle,  we  conclude  that 

Ess  inf_S(x,t)  is  an  increasing  function  of  t for  every  entropy  solution  of 
x 

(1.7).  The  following  argument  sheds  additional  light  on  this  conclusion  in 

the  case  of  a piecewise-smooth  flow.  To  this  end,  an  arbitrary  particle 

currently  located  at  (x,t)  is  traced  backwards  in  time  into  its  initial 

position  at  t = 0.  Since  the  specific  entropy  of  such  particle  remains 

constant  along  the  particle  path — except  for  its  decrease  when  crossing 

backwards  shock  waves,  it  follows  that  its  value  S(x,t)  is  greater  or  equal 

than  that  of  the  initial  spatial  minimum  Ess  inf  S(x,t  = 0),  as  asserted. 

x 

In  contrast  to  the  above  'Lagrangian'  argument,  the  derivation  of  the  minimum 
principle  outlined  below,  is  purely  an  'Eulerian'  one.  It  enables  us  to  relax 
the  regularity  assumption  on  the  flow,  and — since  we  do  not  follow  the 
characteristics,  it  equally  applies  to  discrete  approximations  of  the  Euler 
equations. 
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In  Section  4 we  consider  approximate  solutions  of  the  Euler 

equations,  w(x^,t),  which  respect  the  entropy  decrease  estimate  (1.6), 

T U(w(x  ,t  + At))Ax  < y U(w(x  , t) )Ax  . (1.10) 

V V — V V 

We  note  that  such  approximate  solutions  are  obtained  by  entropy  stable 
schemes  satisfying  the  cell  entropy  inequality 

U(w(xv,t  + At))  < U(w(xv,t))  + 1 -L-  [F^)1/ F<*\y .],  (1.11) 

k=l  Ax  z z 

v 

e.g.,  the  Godunov-type  and  Lax-Friedrichs  schemes  [6].  We  have 

Minimum  Principle:  Let  w(x^,t)  be  an  approximate  solution  of  the  gas 

dynamics  equations  (1.8)  and  let 


S(xv,t)  = S(w(xv>t))  = liKpp^)  (1.12) 

denote  the  specific  entropy  of  such  solution.  Assume  that  its  total  entropy 
decreases  In  time,  (1.10).  Then  the  following  estimate  holds 


S(x,t  + At)  > Min[S(x  ,t)] . (1.13) 

v 

In  the  case  of  entropy  stable  schemes,  (1.11),  a more  precise  estimate  is 
obtained  which  takes  into  account  the  support  of  the  schemes'  stencil. 

The  inequality  (1.13)  leads  to  an  a'priori  polntwlse  estimate  on  the 
approximate  solution  w(x,t).  Such  pointwise  estimates  play  an  essential  role 
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with  regard  to  question  of  the  convergence  of  entropy  stable  schemes.  In 
particular,  DiPerna  [2,  Section  7]  has  recently  shown  that  in  certain  cases, 
such  (two-sided)  estimates  are  sufficient  in  order  to  guarantee  the 
convergence  of  such  schemes . 


2.  GENERALIZED  ENTROPY  FUNCTIONS  OF  THE  EULER  EQUATIONS 
We  consider  the  Euler  equations  for  polytropic  gas 


9_ 

9 1 1 


'P 

m 

E 


3 a 

+ l w- 

k-1  9Xk 


m,. 


00 


qkm  + pe 
qk(E  + p) 


= 0. 


(2.1) 


It  is  well-known,  e.g.,  [1],  that  for  all  smooth  solutions  of  (2.1)  the 

specific  entropy^) 

S(x,t)  = ln(ppwy), 


remains  constant  along  streamlines,  i.e., 


PS  _ 9S  2 9S_ 

Dt  ' 8t  kti  qk  8xk  " 


0. 


(2.2a) 


Let  h(S)  be  an  arbitrary  smooth  function  of  S.  Multiplying  (2.2a)  by 
ph'(S)  — prime  denoting  S-dif ferentiation,  we  find 


(3) 


After  normalization,  taking  the  specific  heat  constant  to  be 


1. 
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„ 3h(S)  , v _ 3MS)  n 


Adding  this  to  the  continuity  equation  which  is  premultiplied  by  h(S), 


. 3 3m, 

&«»♦  1 


(2.2b) 


k=l  k 


we  obtain  after  changing  sign,  a conservative  entropy  equation  like  (1.3) 
which  reads  [5] 


8 [-ph(S)]  + l [-m^S)]  = 0. 

k=l  k 


3t 


(2.3) 


In  order  to  comply  with  the  further  requirement  of  being  a generalized  entropy 

function,  U(u)  = -ph(S)  has  to  be  a convex  function  of  the  conservative 

T 

variables  u *»  (p  ,*,E)  . A straightforward  computation  carried  out  by  Harten 

[5,  Section  2]  in  the  two-dimensional  case  shows  that  the  Hessian  U is 

uu 

positive  definite  if  and  only  if 


p[h'(S)  - yh"(S)]  > 0. 

Excluding  negative  densities  we  may  summarize  that  there  exists  a family  of 
(generalized)  entropy  pairs  (U,!1)  associated  with  Euler  equations  (2.1), 


U(u)  = -ph(S) , F(k)(u)  = -mkh(S)  k = 1,2,3, 


(2.4a) 


generated  by  the  smooth  increasing  functions  h(S)  which  satisfy 


107 


h'(S)  - -yh"(S)  > 0. 


(2.4b) 


3.  A MINIMUM  ENTROPY  PRINCIPLE 

T 

Let  u = (p,m,E)  be  an  entropy  solution  of  the  gas  dynamics  equations 

(2.1).  Such  a solution  is  characterized  by  the  entropy  inequality  (1.7) 


au(u)  y 3F(k)(u)  , 

8t  R-l  ‘ 


(3.1) 


which  holds  for  all  entropy  pairs  (U,f)  connected  with  the  equations. 

To  derive  a minimum  principle,  we  shall  make  use  of  an  argument  due  to  Lax 
[9,  Section  3].  We  begin  with 


Lemma  3.1:  Let  u be  an  entropy  solution  of  the  gas  dynamics  equations 

(2.1).  Then  for  all  nonpositive  smooth  increasing  functions  h(S)  satisfying 
(3.2b),  we  have 

/ p(x,t)*h(S(x,t))dx  > / p(x,0)*h(S(x,0))dx.  (3.3) 

|x|<R  |xl<R+fq 

- ||=  nmax 

Here  9max  denotes  the  maximal  speed  |q|  in  the  domain. 

Proof : As  in  [10,  Theorem  4.1]  we  integrate  the  entropy  inequality  (3.2a) 

over  the  truncated  cone  C - { Ixl  < R + (t  - x)*q  |0  < x < t} ; if  we  let 
(ng,n)  denote  the  unit  outward  normal,  then  by  Green's  theorem 
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(3.4) 


/ ph(S)« 
9C 


n0  + .1,  qknk 
k=l 


ax  > o. 


The  integrals  over  the  top  and  bottom  surfaces  give  us  the  difference  between 
the  left  and  right-hand  sides  in  (3.3)  and  by  (3.4)  this  difference  is  bounded 
from  below  by 


-J  ph(S)* 
mantle 


+ 


Vk 


Bx. 


The  result  follows  upon  showing  that  the  last  quantity  is  nonnegative. 
Indeed,  since  by  assumption  -ph(S)  > 0,  this  is  the  same  thing  as 


on  the  mantle  we  have 


+ I %\  > 0; 

k=l  x x 


(n0>n)  = 


A ~ 


qmax’  ixir 


"max 


and  hence 


”o  + A Vk  ■ . 1 , (' 

/l  + q X 


3 

l 

k=l 


3 qkxk\ 

q + ^ j > 

maX  k=l  Ixl / = 


"max 


yr 


77= 

nmax 


max 


- I 


3 hi 


> 0 


k=l 


as  asserted. 


The  discussion  in  Lemma  3.1  was  restricted  to  smooth  function  h(S);  by 
passing  to  the  limit,  its  conclusion  (3.3)  follows  for  any  nonpositive 
nondecreasing  function  h(S)  satisfying  (3.2b),  whether  smooth  or  not. 
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To  derive  the  minimum  entropy  principle,  we  now  make  a special  choice  of 
such  function,  h(S),  given  by 

h(S)  - Min[  S - S0,0],  SQ  = Ess  inf  S(x,0).  (3.5) 

x| <R+t«q 
= max 

The  nonpositive  function  h(S)  is  a nondecreasing  concave  one,  hence 
admissible  by  (3.2b),  and  consequently  (3.3)  applies 


/ p(x,t)*Min[  S(x,t)  - SQ,0]dx  _> 
|x|<R 


(3.6) 


/ p(x,0)»Min[ S(x,0)  - SQ,0]dx. 

|x|<R+t»q 
1 ■ — nmax 

Now,  by  the  choice  of  Sq,  the  integral  on  the  right  of  (3.6)  vanishes  since 
Min[ S(x,0)-Sq,0]  does.  The  inequality  (3.6)  then  tells  us  that  the  integral 
on  the  left  is  also  nonnegative.  But  since  the  integrand  on  the  left  is  by 
definition  nonpositive,  this  can  be  the  case  provided  this  integrand  vanishes 
almost  everywhere;  that  is,  we  have  for  almost  all  x,  |x|  < R 


and  (1.9)  follows. 


S(x,t)  > SQ 


Ess  inf 
| x | <R+t*q 


max 


S(x, t=0) 


The  minimum  entropy  principle  was  deduced  from  the  entropy  inequality 
(3.2),  which  in  turn  was  postulated  based  on  the  formal  regularization 
introduced  in  (1.4).  In  general,  other  regularizations  equally  apply;  in 
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particular,  Euler  equations  are  usually  sought  as  the  vanishing  viscosity 


limit  of  the  Navier-Stokes  equations  (here  we  take  for  simplicity  the  one- 
dimensional case)^) 


■p' 

m 

, 3 

in 

+ 3 X 

qm  + p 

_E_ 

q(E  + p) 

0 ' 

3x  ’ 


U + 0. 


(3.7) 


Do  the  (generalized)  entropy  inequalities  (3.2)  remain  valid  on  the  basis  of 
such  limit?  To  answer  this  question  we  first  note  that  if  U(u)  is  any 
entropy  function,  then  thanks  to  its  convexity  the  mapping  u -*■  v = Uu  is 
one-to-one,  and  hence  one  can  make  the  change  of  variables  u = u(v).  Harten 
[5]  has  shown  that  such  change  of  variables  by  each  member  of  the  family  of 
entropy  functions  (2.4)  puts  the  viscosity  terras  on  the  right  of  (3.7)  into  a 
negative  semidefinite  form.  This  makes  apparent  the  dissipative  effect  of 

o 

these  viscosity  terms.  Indeed,  if  T = c • E - V2  • | q | denotes  the  absolute 
temperature,  then  direct  manipulation  of  (3.7)  yields,  e.g.,  [1,  Section  63], 
[12,  Section  6.10], 

q2 

1^  [ph(S)]  + [mh(S)]  = yh(S)  ^ , (3.8) 


from  which  we  recover  the  entropy  inequality  (3.2a)  for  all  smooth  increasing 
functions  h(S).  We  note  that  the  convexity  condition  was  not  assumed  in  this 


^^With  y combining  the  two  viscosity  coefficients  in  the  general  Navier- 
Stokes  equations. 
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case.  The  merit  of  using  the  convexity  condition,  however,  is  that  it  enables 
us  to  deal  with  more  general  artificial  viscosity  terras,  other  than  those 
appearing  in  the  Navier-Stokes  equations.  Such  artificial  viscosity  terras  are 
frequently  encountered  in  finite-difference  approximations  to  the  Euler 
equations;  a specific  example  of  this  kind  is  studied  in  the  next  section. 

Finally  we  would  like  to  remark  on  the  previously  mentioned  Navier-Stokes 
equations.  Our  discussion  above  took  into  account  only  the  viscosity 
contribution,  neglecting  heat  conduction.  Hughes,  et  al.,  [7]  have  shown  that 
when  the  heat  flux  is  also  added,  compare  (3.7), 

0 

° (3-9> 

3T 
8x 

with  k denoting  the  heat  conductivity  constant,  then  only  the  'physical' 
entropy,  U(u)  = -pS  survives  as  the  one  which  puts  the  additional  heat  flux 
into  a symmetric  negative-definite  form.  We  would  like  to  note  in  this 
connection  the  difference  limit  behavior  of  the  Navier-Stokes  flows  depending 
on  the  viscosity  and  heat  conductivity;  Gilbarg  [4]  has  shown  that  as  k + 0 
keeping  y fixed,  we  are  led  to  a continuous  thermally  nonconducting  shock 
layer,  whereas  for  y 0 with  k fixed  the  convergence  is  to  a (generally) 
discontinuous  nonviscous  shock  layer.  Consequently,  the  viscosity  rather  than 
the  heat  flux  should  play  the  major  rule  in  an  appropriate  regularization 
model  for  the  Euler  equations. 


112 


4.  DISCRETE  APPROXIMATIONS  OF  THE  EULER  EQUATIONS 

In  this  section  we  consider  approximate  solutions  of  the  Euler 
equations,  w(xv>t),  whose  total  entropy  decreases  in  time,  compare  (1.10) 

£vU(w(xv  ,t  ■+  At))Axy  _<  ^vU(w(xv,t))Axv.  (4.1) 

Estimate  (4.1)  holds  for  all  entropy  functions  U = -ph(s)  in  (2.4).  By 
passing  to  the  limit,  this  applies  to  our  previous  choice  of  the  function 
h(s)  in  (3.5) 

h(s)  = Min[S  - SQ,0],  (4.2a) 

this  time  with  a constant  Sq  which  is  taken  to  be 

SQ  = Min  S(w(x  ,t)).  (4.2b) 

v 

By  our  choice  of  Sq,  we  have  U(w(xv,t))  = 0.  The  inequality  (4.1)  tells  us 
that  the  left-hand  side  is  therefore,  nonnegative;  consequently 

S(x,t  + At)  " SQ  > h(S(x,t  + At))  > 0 


and  (1.13)  follows. 

Approximate  solutions  which  fulfill  the  required  estimate  (4.1)  can  be 
obtained  by  entropy  stable  schemes  satisfying  the  cell  entropy  inequality 
(1.11) 


U(w(x  ,t  + At))  < 


U(w(xv,t)) 


d 

l 

k=l 


1 r -p(k)  v(k) 
~ Lv+1/2  Fv-  V2 


Ax 


]• 


(4.3) 
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Examples  of  such  entropy  stable  schemes  include  the  Godunov-type  and  Lax- 
Friedrichs  schemes,  e.g.,  [ 6 ] • A more  precise  minimum  principle  follows  in 
these  cases,  taking  into  account  the  support  of  the  schemes'  stencil.  In 
particular,  the  (one-dimensional)  Godunov  scheme  results  from  averaging  of  two 
neighboring  Riemann  problems  [6],  each  of  which  satisfies  (1.9).  Consequently 
we  have  the 


Minimum  Principle  (of  the  Godunov  scheme):  Let  w(x^,t)  the  Godunov 

approximate  solution  to  the  Euler  equations  (2*1). Assume  that  the 

appropriate  CFL  condition  is  met.  Then  the  following  estimate  holds 


S(w(x  ,t  + At))  Min  s(w(x. ,t)).  (4.4) 

v v-l<j<v+l  2 

Since  the  Lax-Friedrichs  scheme  coincides  with  a staggered  Godunov's  solver, 
the  same  conclusion,  (4.4),  holds.  Another  way  to  see  this  is  outlined  below; 
it  makes  no  reference  to  Riemann's  solution  and  can  be  generalized  to  the 
multidimensional  problem. 

To  this  end,  we  approximate  the  (for  simplicity — one-dimensional)  Euler 
equations  with  the  Lax-Friedrichs  scheme 


w(  x^  , t + At) 


2 


,t)  + w(x 


v— 1 


X_ 

2 


X 


At 

Ax 


(4.5) 


We  remark  that  the  Lax-Friedrichs  scheme  can  be  derived  from  center 
differencing  of  the  regularization  model  (1.4)  . Lax  has  shown  [9,  Theorem 
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1.2]  that  if  X = — is  sufficiently  small,  then  solutions  of  this  difference 
scheme  satisfy  the  following  cell  entropy  inequality 


U(w(xy  ,t  + At))  <C 


+ u(w(xv_1*t)) 
2 ■ 


- J [F(w(xv+1,t))  - F(w(xv_1,t))] 


(4.6) 


for  all  entropy  pairs  (U,F)  = (-ph(S)  ,-mh(S))  in  (2.4).  by  passing  to  the 
limit,  this  applies  to  our  previous  choice  of  the  function  h(S)  in  (3.5) 


h(S)  = Min[S  - Sq.O], 

this  time,  with  a contant  Sq  which  is  taken  to  be 


(4.7a) 


SQ  ^(Xy  + J »*•)  » ^(X^_^,t)]. 


(4.7b) 


The  inequality  (4.6)  now  reads 
p(xy,t  + At)*h(S(xv,t  + At))  ^ 


1 + Xq(x  ,t) 

2 P(xv-l»t)*h(s(xv_1»t)) 

1 " 

+ 2 P(xv+i,t).h(s(xv+1,t)) 


(4.8) 


By  our  choice  of  the  funtion  h(S)  in  (4.7),  we  have  h(S(xv±1,t))  = 0.  The 
inequality  (4.8)  tells  us  that  the  left-hand  side  is  therefore  nonnegative; 
consequently 
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0 £ h(S(xv,t  + At)}  < S(x,t  + At)  - Sq 
and  the  following  minimum  principle  follows 


S(w(xy>t  + At))  _>  Min  s(w(xv±1  *fc))  • 
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ABSTRACT 

A multidomain  Chebyshev  spectral  collocation  method  for  solving  hyperbolic 
partial  differential  equations  has  been  developed.  Though  spectral  methods 
are  global  methods,  an  attractive  idea  is  to  break  a computational  domain  into 
several  subdomains,  and  a way  to  handle  the  interfaces  is  described.  The 
multidomain  approach  offers  advantages  over  the  use  of  a single  Chebyshev 
grid.  It  allows  complex  geometries  to  be  covered,  and  local  refinement  can  be 
used  to  resolve  important  features.  For  steady-state  problems  it  reduces  the 
stiffness  associated  with  the  use  of  explicit  time  integration  as  a relaxation 
scheme.  Furthermore,  the  proposed  method  remains  spectrally  accurate. 
Results  showing  performance  of  the  method  on  one-  dimensional  linear  models 
and  one-  and  two-dimensional  nonlinear  gas-dynamics  problems  are  presented. 


Research  was  supported  by  the  National  Aeronaiitics  and  Space  Administration 
under  NASA  Contract  Nos.  NAS1-17070  and  NAS1-18107  while  the  author  was  in 
residence  at  ICASE,  NASA  Langley  Research  Center,  Hampton,  VA  23665-5225. 


1.  INTRODUCTION 


In  this  paper  we  address  the  problem  of  efficiently  computing  Chebyshev 
spectral  collocation  approximations  to  quasilinear  hyperbolic  systems  of  the 
form 

Qt  + ACQ)^  + B(Q)Qy  = 0 x,y  DCR2,  t > 0 (1) 

with  appropriate  boundary  and  initial  conditions.  Here,  Q is  an  m-vector 
and  A and  B are  mxm  matrices.  This  system  is  hyperbolic  if  for  any 
constants  kj  and  k£  the  matrix  T = k^  A + k2  B has  only  real  eigenvalues 
and  there  exists  a similarity  transformation  matrix,  P,  such  that  PTP  * = A 
is  a real  diagonal  matrix. 

In  particular,  we  are  interested  in  the  solution  of  the  Euler  equations  of 
gas  dynamics  which  form  a system  of  this  type.  The  use  of  the  nonconservation 
form  is  justified  for  problems  in  which  shocks  are  fitted  and  in  this 
situation  spectral  methods  work  well  [ 1 ] . Problems  of  the  type  presented  in 
Ref.  [1]  provide  the  motivation  for  what  follows. 

The  typical  Chebyshev  spectral  collocation  procedure  for  the  solution  of 
the  system  (1)  is  described  in  several  reviews  such  as  those  of  Gottlieb, 
Hussaini,  and  Orszag  [2],  and  Hussaini,  Salas,  and  Zang  [3].  First,  the 
domain  of  interest  is  mapped  onto  the  square  D'  = [-1 ,1 ]x[-l ,1]  and  an 
(N+M)  x (M+l)  point  mesh  is  generated  with  the  collocation  points  defined  by 

x±  « - cos(ir/N)  i = 0,1,«*»,N 

y^  = “ cos( jir/H)  j = 0, !,•••, M.  (2) 
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Mesh  point  values  of  Q,  designated  by  Q^j » are  associated  with  each  of  the 
collocation  points  (x^,yj).  A global  Chebyshev  interpolant  of  order  N in 
the  x direction  and  order  M in  the  y direction  is  then  put  through  the 
mesh  point  values 


Qp(x,y) 


N,M 

I 


n,m=0 


a 

nm 


T (x)T  (y). 
n m 


(3) 


Approximations  to  the  derivatives  at  the  collocation  points  are  computed 
by  differentiating  the  interpolant  and  evaluating  the  resulting  polynomial  at 
the  collocation  points.  The  computation  of  the  derivatives  can  be 
accomplished  in  one  of  two  ways  (see  Gottlieb,  et  al.,  [2]):  The  first  is  to 
take  advantage  of  the  fact  that  the  sums  for  both  the  interpolant  and  its 
derivative  reduce  to  cosine  sums  at  the  chosen  collocation  points.  For 
example 


where 


dx 


N,M 

l 


n,m=0 


a r(x)T  (y) 
nm  n m 


N,M 

l b T (x)T  (y) 
~ nm  n m 
n,m=0 


(4) 


bM 

Nm 


= 0, 


bM  . = 2Na 

N-l,m  nm 


(5) 


and 


Cnbnm  = bn+2,m  + 2(n  + 1)an+l,m  f°r  0 < n < N “ 2. 


The  constant  cn  is  defined  as  cn  = 2 for  n = 0,N  and  cn  = 1 
otherwise.  The  advantage  of  this  form  is  that  a fast  cosine  transform  can 
compute  the  derivatives  along  each  y line  in  0(N  log  N)  operations. 
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The  other  approach  to  computing  the  derivatives  is  to  write  the 
differentiation  operation  as  the  product  of  a differentiation  matrix  and  the  a 
vector  of  the  s.  For  example,  along  each  y line  the  x derivative  is 

dQ 

(d/)j  - D(Qp)j  (6) 

where  (Qp)j  = [ ^0, j ^l,j  ***  ^N,j^  and  tlie  elements  the  matrix  D are 

defined  in  Gottlieb  et  al.,  [2].  The  amount  of  work  with  this  procedure  is 
2 

of  0(N  ).  What  one  loses  in  efficiency  one  gains  as  flexibility  in  the 
number  of  mesh  points  that  can  be  used  in  each  direction  without  adding 
storage. 

No  matter  which  way  the  spatial  derivatives  are  computed,  it  is  Important 
to  note  that  computing  the  Chebyshev  derivative  approximations  requires  only 
mesh  point  values.  Derivatives  at  the  end  points  require  only  points  interior 
to  the  mesh  so  no  extra  procedure  is  required  to  compute  derivatives  at 
boundaries. 

Once  the  spatial  derivatives  are  approximated,  what  results  is  a system  of 
ordinary  differential  equations  in  time  for  the  variation  of  the  solution  at 
each  collocation  point  (Method  of  Lines).  Because  the  differentiation  matrix 
is  full,  explicit  methods  are  typically  used  to  integrate  the  semi-discrete 
equations.  In  this  paper,  all  time  integrations  will  be  performed  with  a 
fourth-order  Runge-Kutta  method. 

The  advantage  of  using  this  spectral  method  to  solve  (1)  is  that  for 

oo 

solutions  which  are  C (D) , the  accuracy  is  better  than  any  polynomial  order 
(Canuto  and  Quarteroni,  [4]).  This  is  usually  called  "spectral  accuracy"  and 
asymptotic  behavior  can  be  observed  if  there  are  enough  grid  points  to 
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adequately  resolve  the  solution.  It  is  thus  possible  to  compute  to  a given 
spatial  accuracy  with  fewer  grid  points  than  required  by  typical  low-order 
finite  difference  approximations. 

Balancing  the  high  accuracy  of  the  spectral  method,  however,  are  some 
major  disadvantages  of  the  typical  Chebyshev  collocation  approach: 

(1)  It  may  not  be  easy  or  even  possible  to  map  D -»•  D"  globally. 

(2)  The  collocation  point  distribution  is  global  and  predetermined.  Local 
refinement  of  the  mesh  is  not  possible. 

(3)  The  points  are  concentrated  near  the  boundaries  where  they  are 
typically  not  needed  for  hyperbolic  problems. 

(4)  If  explicit  time  integration  is  used  the  time  step  restriction  in  one 
dimension  is  proportional  to  1/N  . 

(5)  For  complete  flexibility  in  the  number  of  mesh  points  which  can  be 
used,  the  derivatives  cost  of  0(N  ) in  each  direction. 

These  problems  can  be  reduced  significantly  by  breaking  up  the  region  D 
into  several  subdomains  each  of  which  has  its  own  Chebyshev  grid.  With  a 

stable  and  efficient  method  for  computing  the  interfaces,  the  advantages  of 
such  an  approach  would  be: 

(1)  Complicated  geometries  can  be  covered. 

(2)  Points  can  be  distributed  with  some  flexibility;  local  refinement  is 
possible. 

(3)  In  one  dimension,  with  N points  and  K subdomains,  the  time  step 

2 

restriction  increases  to  At  « K/N  . 

(4)  Derivative  evaluation  work  with  matrix  multiplication  decreases  to 
K(N/K)2  or  1/K  that  of  a single  grid. 
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The  idea  of  breaking  up  the  computational  domain  into  subdomains  each  with 
a different  grid  is  not  new.  For  finite  difference  methods  this  is  a 
currently  popular  approach  (e.g.,  [5]).  For  spectral  methods,  however, 
previous  applications  have  been  limited  to  elliptic  and  parabolic  problems. 
Orszag  [6]  first  applied  such  a technique  to  solve  elliptic  problems.  He 
enforced  continuity  of  the  function  and  its  first  derivative  as  the  interface 
condition.  Metivet  and  Morchoisne  [7]  and  later,  Morchoisne  [8]  computed 
multidomain  solutions  to  the  Navier-Stokes  equations.  Recently,  Patera  [9] 
and  Korczak  and  Patera  [10]  have  been  using  a spectral  element  method  to  solve 
the  incompressible  Navier-Stokes  equations.  Their  method  is  very  similar  to 
the  p finite-element  methods  developed  by  Babuska  (see  [10])  but  uses 
Chebyshev  interpolants.  The  treatment  of  the  convective  terms,  however,  does 
not  lend  itself  to  purely  convective  problems.  For  these  problems,  we 
describe  the  method  below. 


2.  MULTIDOMAIN  APPROACH 

In  this  paper,  we  will  break  up  the  physical  domain,  D,  into  K 
subdomains  which  do  not  overlap  except  for  the  common  boundary  points. 

Figure  1 shows  a rectangular  two-dimensional  example  of  the  situation  with 
four  subdomains.  Each  of  the  are  mapped  onto  a square  [-1 , 1 ] x [-1 , 1 ] . 

Spatial  approximations  at  interior  points  of  each  subdomain  are  computed  in 
the  usual  way.  Across  an  interface,  however,  there  are  two  values  of  the 
normal  derivative.  For  example,  at  the  y coordinate  line  interface  between 
and  in  Figure  1,  derivative  approximations  are  available  from  the 

left  and  from  the  right.  The  problem  is  to  choose  properly  information  from 
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the  right  and  the  left  to  give  a stable  and  consistent  approximation  to  the 
differential  equation  at  the  interface. 

Before  discussing  a multidomain  method  for  the  boundary  value  problem  (1), 
we  will  first  examine  the  one-dimensional  case.  In  one  dimension,  we  seek 
interface  algorithms  of  the  semidiscrete  form 


(7) 


where  denotes  the  value  of  Q at  an  interface  and  the  derivatives 
superscripted  with  L and  R denote  the  two  spectral  approximations  computed 
in  the  left  and  right,  respectively.  For  consistency,  we  require  that 


AL  + AR  = A 


(8) 


and  for  efficiency  we  want  AR  and  AR  to  be  computed  with  little  more  work 
than  is  required  for  the  computation  of  A itself. 

To  generate  the  coefficient  matrices,  consider  first  the  linear  scalar 
hyperbolic  equation 

ut  + Aux  =0  X > 0.  (9) 

Because  the  equation  is  hyperbolic,  it  is  clear  that  the  common  interface 
point  should  depend  only  on  information  propagated  from  the  left.  Thus,  the 
approximation  should  be 


8u 


I 


at 


+ 


(10) 
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This  is,  of  course,  just  upwind  differencing  at  the  interface  and  is 
equivalent  to  the  way  Gottlieb  and  Orszag  [11]  handled  a tau  approximation  to 
equation  (9)*  To  simplify  the  computational  logic  to  include  cases  where  the 
coefficient,  X,  is  of  either  sign,  the  approximation  (10)  can  be  written  as 

fr  + 1/2(x  + |x|)  f^  + V2(x  - |x| ) o.  (11) 

If  we  now  consider  that  this  equation  is  a single  component  of  a 

diagonalized  system,  where  the  diagonal  matrix 

= P-1  AP, 

we  can  write  the  system  as 

|2-+V2(A  + |A|)  |2-+V2(A  - |A|)  |^-  0 (12) 

where  | A | = P | A | P Formally,  this  is  nothing  more  than  the  method  of 

characteristics  in  one  dimension. 

We  now  propose  to  avoid  the  computation  of  the  matrix  absolute  value  by 
approximating  it  with  a diagonal  matrix 

|A|  - PX*  IP-1  = X*  I (13) 

* 

where  X is  chosen  to  lie  between  the  largest  and  smallest  elements  of  | A | . 
The  boundary  scheme  is  now  of  the  form  of  Eq.  (7)  with 
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(14) 


al  = l/2  (a  + a*  i)  ar  = y2  <A  “ X*  I). 

This  choice  of  coefficient  matrices  always  has  proper  upwind  dominance  on 

all  of  the  characteristic  variables,  but  includes  some  downwind  influence.  To 

th 

see  this,  re-diagonalize  the  system  (7)  and  use  u as  the  nc  component  of 
the  diagonalized  system.  Then  the  approximation  to  the  method  of 
characteristics  causes  the  characteristic  variables  at  the  interface  to  be 
approximated  by 


3u 

at 


+ V2  (x 


n 


* ) 


3u 

ax 


+ V2(xn-x  ) 


liL 

3x 


R 


= 0. 


(15) 


In  fact,  this  can  be  viewed  as  the  purely  upwind  scheme  with  an  error  term: 

For  the  X >0  case, 
n ’ 


3u*  . 3u^ 

3t  An  3x 


8x 


(16) 


Thus,  we  have  the  spectrally  accurate  upwind  approximation  with  an  error 
term  proportional  to  the  difference  of  the  right  and  left  spectral 
derivatives.  If  the  solution  has  the  necessary  smoothness,  this  difference 
should  also  decay  spectrally  and  spectral  accuracy  of  the  approximation  should 
be  retained. 

We  will  study  the  stability  of  the  multidomain  method  with  the  interface 
approximation  (14)  numerically.  An  analytic  study  of  stability  is  not 
possible  at  this  time.  Stability  theory  for  Chebyshev  approximations  to 
hyperbolic  initial-boundary  value  problems  is  not  advanced  enough  to  analyze 
an  approximation  which  introduces  some  downwind  influence  at  the  interface. 
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We  consider  the  two-domain  approximation  of  the  scalar  equation  (9)  with 
the  interface  approximation  (12)  with  X = 1.  The  line  segment  [-2,2]  is 
divided  equally  into  two  domains  of  [-2,0]  on  the  left  and  [0,2]  on  the  right. 
The  semidiscrete  approximation  can  be  written  as  a system  of  ordinary 
equations  with  the  two-domain  coefficient  matrix 


0 


0 


(17) 


L R 

where  D and  D are  the  single  domain  differentiation  matrices  for  the 
left  and  the  right,  modified  to  include  the  interface  approximation.  For  this 
system  to  be  time  stable,  that  is,  the  solution  does  not  grow  unboundedly  as 
t + 00 , the  eigenvalues  of  the  coefficient  matrix  must  have  negative  real 
parts. 

Figure  2 shows  how  the  eigenvalues  change  as  X varies  when  6 points  are 
* 

used.  The  case  of  X = 0 corresponds  to  simple  averaging  and  is  clearly  not 

time  stable.  Choosing  X >0  large  enough  moves  the  eigenvalues  into  the 

left  half  of  the  complex  plane  and  the  resulting  approximation  is  time 

* 

stable.  The  case  of  X = 1 is  the  purely  upwind  case  and  the  eigenvalues 

decouple  into  two  single-domain  patterns.  If  X is  chosen  equal  to,  or 

larger  than,  the  wave  speed,  X^,  the  approximation  has  the  effect  of  adding  a 

purely  dissipative  term  to  the  equation  and  two  purely  real  eigenvalues  are 
* 

created.  If  X is  very  much  larger  than  X^,  however,  the  eigenvalues 

migrate  to  the  right  of  the  imaginary  axis.  The  range  of  X's  for  which  the 
approximation  is  stable  decreases  as  the  disparity  in  the  number  of  points 
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becomes  larger;  for  very  stiff  systems,  it  may  be  necessary  to  use  |a| 

k 

instead  of  X at  the  interface. 

It  is  interesting  to  note  that  the  reverse  situation,  where  there  is  more 

resolution  on  the  upstream  side  of  the  interface,  does  not  show  this  behavior 

* * 
and  is  stable  for  all  X ^ 0.  For  systems,  this  means  that  X should  be 

chosen  to  be  only  slightly  larger  than  the  smallest  eigenvalue  representing  a 

characteristic  moving  from  the  coarse  to  the  fine  grid.  For  systems,  this 

means  that  X should  be  chosen  to  be  only  slightly  larger  than  the  smallest 

eigenvalue  representing  a characteristic  moving  from  the  coarse  to  the  fine 

grid.  We  note,  however,  that  the  examples  on  which  the  scheme  has  been  tested 

k 

show  that  the  approximation  is  robust  over  a wide  range  of  choices  of  X . 

In  two  dimensions,  the  upwind  weighted  approximation  is  used  in  the 
direction  perpendicular  to  the  interface.  Returning  to  Figure  1,  along  x 
coordinate  lines,  the  y derivatives  are  continuous  across  the  interfaces 
except  at  corners.  At  points  not  on  the  corners,  then,  we  propose  using 


3Q 

at 


+ A 


aQ 

ax 


+ A 


R 3(T 
9x 


b|£-o 

ay 


(18) 


where  AL  and  AR  are  defined  as  above.  Along  x coordinate  interfaces, 

|i+A|o]:+BT|i+BB|i.o  09) 

at  3x  3y  8y 

where  BT  = V2  (B  + p*  I)  and  BB  = l/2  ( B - p*  I)  and  p*  is  an  approxima- 
tion to  the  eigenvalues  of  B.  At  corners,  the  weighted  approximations  are 
used  in  both  directions. 
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3.  NUMERICAL  EXAMPLES 

Numerical  experiments  on  four  model  problems  in  one  and  two  dimensions 
presented.  The  models  include  the  scalar  one— dimensional  hyperbolic 
initial  boundary  value  problem  for  a travelling  Gaussian  pulse,  a linear 
system  in  one  dimension,  quasi— one— dimensional  flow  in  a converging— diverging 
nozzle,  and  the  transonic  Ringleb  problem.  The  Ringleb  flow  models  the  smooth 

transonic  flow  in  a curved  duct  and  has  an  exact  solution  to  which 
to  compare. 


A.  Solution  of  a Linear  Scalar  Problem 

The  solution  to  the  linear  scalar  problem 


9u  , 0 3u 

at  Z 3x 


0 


x e [-2,2] , t > 0 (20) 


u(x,0)  = exp(-(x  - xq)2/0.3)  x € [-2,2] 

u(-2 , t)  = exp(-(x  - t - xq)2/0.3)  t > 0 

can  be  used  to  examine  the  effects  of  varying  X in  the  spatial 

approximation  described  in  Eq.  (15).  The  time  integration  for  this  and  all 

following  examples  was  a fourth-order  Runge-Kutta  technique.  For  this  and  the 

next  model  problem  the  time  step  was  chosen  so  that  the  temporal  errors  were 

on  the  order  of  10  The  main  questions  to  be  answered  here  are  the  effect 

* 

of  the  X * 2 on  the  accuracy  of  the  solution  and  if  reflections  are  a 
problem  at  the  interface*  Figure  3 shows  the  computed  (circles)  and  exact 
(line)  solutions  for  the  pulse  after  it  has  propagated  through  the  interface 
at  x = 0 for  two  distributions  of  the  mesh  points  and  X =6. 
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The  interface  approximation  Eq.  (15)  degrades  the  accuracy  of  the  solution 

* 

when  compared  to  the  purely  characteristic  interface,  X = 2,  if  equal 
resolution  is  not  provided  in  each  subdomain.  In  no  case,  however,  is  the 
global  L2  error  larger  than  the  global  error  for  the  characteristic  inter- 

*Dg 

face.  Furthermore,  if  A remains  fixed  and  the  total  number  of  points  is 
increased,  the  error  decay  remains  spectral.  Figure  4 shows  the  pointwise 
errors  of  the  solution  to  Eq.  (20)  for  the  situations  represented  in  Figure  3 

•k 

as  A is  increased  beyond  the  characteristic  value  of  2.  The  situation  is 

worse  when  more  resolution  is  used  upstream  of  the  interface  because  the 

* 

approximation  includes  more  and  more  downwind  influence  as  X is 
increased.  In  a practical  computation,  the  effect  of  the  boundary 
approximation  would  not  be  important  if  the  solution  were  equally  resolved  in 
all  subdomains • 

Reflections  at  the  interface  are  not  visible  in  Figure  3 even  though  there 
is  a factor  of  two  difference  in  the  number  of  collocation  points,  Gottlieb 
and  Orszag  [11]  also  noticed  this  for  a tau  approximation  to  the  scalar  wave 
equation.  This  is  typical  for  the  spectral  approximations;  examples  with  up 
to  a factor  of  three  and  four  in  the  ratio  of  the  number  of  mesh  points  have 
not  shown  spurious  reflections  off  of  the  interface. 


B.  A Linear  System  Example 

The  accuracy  of  the  interface  approximation  will  now  be  demonstrated  with 
the  2x2  linear  system 


X 6 [-2,2], 


t > 0. 


(21) 
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The  coefficent  matrix  has  eigenvalues  +3  and  -1  so  the  system  has 
information  which  propagates  in  both  directions  and  with  different  speeds 
across  the  interface  at  x = 0.  The  initial  and  boundary  conditions  were 
chosen  so  that  the  characteristic  variables  were  the  Gaussian  pulses  used  in 
the  scalar  problem,  Eq.  (20)*  The  coefficient  X for  this  case  was  chosen 
to  be  the  maximum  eigenvalue,  X = +3.  Figure  5 shows  the  results  for  the  two 
components  of  this  system  at  a time  when  the  characteristic  pulses  have 
crossed  the  interface.  In  Figure  5a  there  are  twice  as  many  points  to  the 
left  of  the  interface  as  to  the  right  and  this  is  reversed  for  Figure  5b.  The 
symbols  represent  the  computed  solutions  and  the  solid  lines  represent  the 
exact  solutions. 

A study  of  discrete  errors  for  the  system  computations  is  shown  in 
Tables  I through  III.  Clearly,  the  error  is  spectral  for  all  three 
situations.  In  fact,  for  an  equal  number  of  mesh  points  on  either  side  of  the 
interface,  the  error  decay  is  exponential.  For  the  problem  of  propagating 
pulses,  where  the  features  needing  higher  resolution  are  continually  moving, 
it  is  not  surprising  that  the  best  errors  are  obtained  when  there  are  an  equal 
number  of  mesh  points  on  both  sides  of  the  interface. 


C.  Quasi-One-dimensional  Nozzle  Flow 

One  potential  point  of  concern  in  using  the  interface  approximation  given 
by  Eq.  (14)  regards  the  stability  of  cases  where  one  of  the  eigenvalues  of  the 
coefficient  matrix  is  much  larger  than  any  other.  Such  a situation  occurs  at 
sonic  points  in  an  ideal  gas  flow  where  one  of  the  characteristic  speeds 
actually  vanishes. 
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To  test  this  situation  the  nonlinear  problem  of  steady  gas  flow  in  a 
quasi-one-dimensional  converging-diverging  nozzle  was  solved  with  the 
multidomain  method  where  an  interface  was  placed  at  the  sonic  point.  The 
quasilinear  form  of  the  Euler  gas  dynamics  equations  for  time-dependent  flow 
in  a quasi-one-dimensional  nozzle  without  shocks  can  be  written  as 


■p“ 

+ 

u 

y 

“p* 
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u 

t 

2, 

a h 

u 

u 

- 

- 

- - 

X 

YuAx(x)/A(x) 


0 


(22) 


where  P is  the  logarithm  of  the  pressure,  u is  the  gas  velocity,  y is  the 
ratio  of  specific  heats,  and  a is  the  sound  speed.  The  coefficient  matrix 
has  eigenvalues  of  u + a and  u - a so  that  one  of  them  is  zero  at  a sonic 
point.  The  steady  flow  is  found  as  the  large  time  limit  of  the  unsteady  flow 
described  by  (22). 

The  nozzle  area  is  given  by  A(x)  = x/2  + 1/x  so  the  throat  occurs  at 
x = /2.  For  the  cases  run,  a subsonic  inflow  boundary  was  placed  at  x = 0.2 
and  characteristic  boundary  conditions  were  used.  After  the  gas  accelerates 
through  the  sonic  value  at  the  throat,  it  leaves  the  nozzle  supersonically  so 
no  boundary  conditions  are  applied  at  the  outflow. 

For  the  gas  dynamics  calculations  in  one  dimension,  X =V2(|u+a|  + |u-a|) 
was  chosen  since  this  corresponds  to  the  diagonal  elements  of  the  absolute 
value  of  the  coefficient  matrix.  Although  the  problem  was  solved  for  domain 
interfaces  in  both  the  subsonic  and  supersonic  portions  of  the  nozzle,  only 
results  for  a single  interface  at  the  sonic  point  will  be  shown  here.  (The 
two-dimensional  example  below  will  include  a variety  of  interface  placements.) 
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Figure  6 shows  the  steady  pressure  in  the  nozzle  computed  with  two  domains 
and  twice  as  many  mesh  points  on  the  right  as  on  the  left.  Our  tests  on  a 
variety  of  grids  have  not  shown  any  stability  difficulties  in  computing  steady 
flows  when  placing  the  interface  at  a sonic  point. 


D.  Two-Dimensional  Transonic  Flow 

A more  complicated  problem  is  the  two-dimensional  transonic  Ringleb 
flow.  This  problem  allows  us  to  study  the  computational  efficiency  of  the 
multidomain  solution  algorithm  as  outlined  in  the  Introduction.  Kopriva,  et 
al.,  [12]  used  this  problem  for  a comparison  of  the  performance  of  the 
spectral  method  with  a second— order  finite— difference  method.  In  this  section 
we  will  compare  the  multidomain  spectral  method  with  the  single  domain 
spectral  method. 

The  Ringleb  flow  is  a simple  example  of  a two-dimensional  transonic  flow 
for  which  there  is  an  exact  solution.  (See,  for  example,  Courant  and 
Friedrichs  [13].)  The  streamlines  of  the  physical  space  solution  appear  at 
large  distances  as  parabolas  which  are  determined  from  a special  hodograph 
solution  of  the  potential  equation  for  steady  irrotational  isentropic  flow. 
By  choosing  two  streamlines  to  represent  solid  walls,  this  problem  models  a 
steady  transonic  flow  in  a duct.  Figure  7 shows  the  Mach  contours  of  one  such 
duct  flow. 

Again  we  will  look  for  the  large  time  solution  of  the  unsteady  gas 
dynamics  equations,  this  time  in  two  dimensions.  The  problem  in  the  curved 
duct  shown  in  Figure  7 is  mapped  onto  a rectangle  in  the  stream  function- 
potential  coordinate  system  derived  from  the  exact  solution.  In  this 
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coordinate  system,  the  unsteady  equations  can  be  written  as 


Qt  = -R  (23) 

where  R is  the  steady  state  residual 

R = AQ+  BQ,.  (24) 

Since  the  solution  is  irrotational,  the  solution  vector  is  chosen  to  be 

Q = [P  u v]T  (25) 

and  the  coefficient  matrices  are 


As  before,  P represents  the  logarithm  of  the  pressure  and  (u,v)  represent 

the  velocity  components  in  the  Cartesian  x and  y directions, 

respectively.  The  matrix  coefficients  are  computed  from  the  mapping  derived 

from  the  exact  solution  and  the  contravariant  velocity  components  are 

U = u<t>  + v<f>  and  V = uil>  + vil)  , 
rx  Yy  rx  ry 
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The  physical  boundary  conditions  for  this  problem  represent  subsonic 
inflow  at  the  entrance  of  the  duct  (at  the  lower  left  of  Figure  7),  supersonic 
outflow  at  the  exit,  and  the  sides  are  treated  as  impermeable  boundaries 
(walls).  So  that  the  initial  boundary  value  problem  is  well-posed  the 

boundary  conditions  must  be  chosen  carefully.  See  Kopriva,  et  al.,  [12]  for 
details  of  the  procedure  which  follows.  For  the  subsonic  inflow,  we  can 
specify  only  two  quantities  and  have  chosen  the  total  enthalpy  and  the  angle 
of  the  flow  (so  V = 0) . The  quantities  P and  U are  computed  from  two 
conditions:  The  first  is  a compatibility  equation  derived  from  the  pressure 

equation  and  the  normal  momentum  equation.  The  second  comes  from 
differentiating  the  enthalpy  equation  in  time.  From  U and  the  condition 
V — 0,  the  Cartesian  velocities  u,  v can  be  computed.  At  the  outflow,  no 
boundary  conditions  are  needed.  Finally,  at  the  walls  the  normal  velocity,  U, 
must  vanish.  The  vector  Q is  computed  by  solving  the  tangential  momentum 
equation  for  V and  a compatibility  equation  which  combines  the  normal 

momentum  and  pressure  equations  for  P. 

The  system  of  equations  (22)  were  discretized  as  described  above,  and 
fourth-order  Runge-Kutta  was  used  for  the  time  integration.  For  a single 
domain,  the  Chebyshev  spectral  grid  for  the  Ringleb  problem  with  16  streamwise 
and  8 normal  mesh  intervals  is  shown  in  Figure  8.  It  is  clear  that  the 
spectral  method  strongly  concentrates  the  grid  points  near  the  walls.  The 
largest  gradients,  however,  occur  in  the  streamwise  direction  near  the  sonic 
line  (as  can  be  seen  in  Figs.  7 and  9)  where  the  streamwise  mesh  distribution 
is  coarsest.  These  two  factors  contribute  to  the  fact  that  the  time 
integration  step  is  very  small  and  that  accuracy  is  degraded  by  the  lack  of 
resolution  where  it  is  needed. 


136 


A multidomain  grid  distribution  for  which  performance  will  be  compared  to 
the  single  domain  method  is  shown  in  Figure  10,  Six  domains  now  cover  the 
duct  and  the  same  number  of  mesh  intervals  as  for  the  single  domain  case  are 
used.  The  divisions  were  chosen  to  demonstrate  the  kinds  of  situations  which 
the  multidomain  method  should  be  able  to  handle.  Three  divisions  with 
6 + 5+5  mesh  intervals  are  in  the  streamwise  direction  and  two  are  in  the 
normal  direction.  With  this  choice,  two  points  occur  where  the  corners  of 
four  domains  come  together.  The  first  domain  boundary  in  the  streamwise 
direction  was  chosen  to  appear  in  a subsonic  region  of  the  duct.  The  second 
domain  boundary  in  the  streamwise  direction  was  chosen  to  intersect  the  sonic 
line.  By  dividing  the  normal  direction  into  two  domains,  the  effective  mesh 
spacing  near  the  walls  is  doubled.  Finally,  note  that  by  comparing  Figure  10 
to  Figure  7 the  sonic  line  also  intersects  the  domain  interface  in  the  normal 
direction. 

To  allow  comparison,  Figure  11  shows  the  Mach  number  contours  for  both  the 
single  domain  and  the  multidomain  solutions.  Note  particularly  that  the  sonic 
line  remains  smooth  through  the  domain  interfaces.  Table  IV  summarizes  the 
performance  of  the  single  domain  spectral  method  compared  with  this  particular 
choice  of  grid.  First,  note  that  even  with  this  distribution  of  domains,  the 
maximum  error  in  the  pressure  for  the  multidomain  computation  has  not  been 
degraded  from  the  single  grid  one.  In  fact,  the  error  is  five  percent  better. 

The  real  advantage  that  the  splitting  has  had  for  this  case,  however,  is 
that  the  multidomain  solution  relaxes  more  quickly  to  steady  state  for  a given 
number  of  intervals  and  accuracy.  Figure  12  compares  the  rate  at  which  the 
discrete  norm  of  the  residual  of  the  pressure  decays  for  the  single  and 
multidomain  cases.  The  results  are  also  summarized  in  Table  IV.  From  the 
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trend  of  the  graph,  it  should  take  over  2 1/2  times  as  many  iterations  for  the 
single  grid  residual  to  decay  to  that  of  the  single  grid  residual.  This  is  a 
direct  result  of  the  fact  that  larger  time  steps  can  be  used  for  the  multi- 
domain case.  The  choice  of  X also  affects  the  convergence  rate:  larger 
values  up  to  the  stability  limit  give  faster  convergence  to  steady  state. 

The  advantage  of  a k-domain  derivative  computation  requiring  1/k  the 
amount  of  work  as  a single  domain  computation  does  not  show  up  in  this 
example.  In  fact,  as  Table  IV  indicates,  the  average  time  per  iteration  (time 
step)  requires  the  same  amount  of  time  at  0.5  sec.  on  the  Langley  Cyber  855. 
This  is  due  to  the  fact  that  there  is  overhead  in  computing  the  interface 
approximation.  Doubling  the  number  of  points  in  each  direction  with  the  same 
domain  distribution  decreases  the  time  per  iteration  for  the  multidomain 
computation  to  70%  of  the  single  domain  cost.  Though  no  attempt  was  made  to 
compute  the  interface  conditions  efficiently,  the  number  of  points  inside  each 
domain  will  have  to  be  large  compared  to  the  number  of  domains  for  the 
efficiency  gained  by  being  able  to  use  fewer  points  in  computing  derivatives 
to  become  important. 

The  final  advantage  of  a multidomain  method  which  was  listed  in  the 
Introduction  is  that  flexibility  in  the  choice  of  grid  point  distribution  is 
now  possible.  A series  of  calculations  were  made  with  the  duct  being  divided 
into  two  domain  intervals  in  each  direction.  As  with  Figure  10,  the  direction 
across  the  duct  was  divided  in  half  and  the  same  number  of  mesh  points  was 
used.  In  the  streamwise  direction,  however,  only  one  domain  boundary  was 
inserted.  This  boundary  was  inserted  in  several  places  along  the  duct  with 
different  numbers  of  points  on  either  side. 
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Results  of  some  of  the  computations  are  summarized  in  Table  V.  The 
division  is  reported  in  terms  of  the  fraction  of  the  total  variation  of  the 
velocity  potential  along  the  length  of  the  duct.  The  first  entry  in  the  list 
places  the  division  approximately  near  the  bend  of  the  duct  where  the 
gradients  of  the  solution  are  the  highest.  It  is  clear  that  with  a proper 
choice  of  grid  it  is  possible  to  obtain  better  accuracy  with  the  multidomain 
distribution  of  a given  number  of  grid  points  than  with  a single  grid.  For 
the  best  case  computed  here,  the  error  is  about  2 1/2  times  better  for  the 

multidomain  calculation. 

The  problem  of  how  to  properly  distribute  points  and  subdomains  in  general 
is  a major  one  and  is  beyond  the  scope  of  this  paper.  If  they  are  poorly 

placed  the  error  can  be  worse  than  the  single  domain  error  (see  Table  V),  For 

now,  it  is  not  known  how  to  obtain  the  optimal  point  and  subdomain 

distribution.  Rather,  some  knowledge  of  the  behavior  of  the  solution  must  be 
used  as  a guide. 


CONCLUSIONS 

We  have  described  a simple  approximation  which  allows  a multidomain 
spectral  solution  of  quasilinear  hyperbolic  equations.  Numerical  examples  of 
linear  equation  models  and  ideal  gas  flow  show  that  the  method  gives 
advantages  in  both  accuracy  and  efficiency  over  using  a single  domain. 
Dividing  up  a computational  domain  into  several  subdomains  gives  the 
possibility  of  local  refinement  and  allows  some  flexibility  in  the 
distribution  of  mesh  points.  It  is  possible  to  obtain  better  accuracy  by 
doing  so.  Also,  with  multiple  domains  it  is  possible  to  take  larger  time 
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steps  than  with  a single  domain.  This  increases  the  efficiency  for  using  time 
relaxation  to  acheive  steady  state  solutions. 

The  use  of  a multidomain  technique  is  also  appropriate  if  discontinuities 
are  fitted  as  boundaries.  When  shocks  occur  within  a flow,  subdomains  would 
be  arranged  so  that  each  shock  lies  on  a subdomain  boundary.  In  smooth  parts 
of  the  solution,  the  technique  described  here  would  be  used.  Along  shock 
interfaces,  a shock  fitting  algorithm  like  that  described  in  reference  [1]  can 
be  used  (Kopriva  and  Hussaini,  to  be  published). 

The  theoretical  issues  which  remain  are  many.  Some  theory  for  the  range 
* 

of  values  which  X can  take  for  the  method  to  be  stable  must  be  found. 

* 

However,  choosing  X to  be  the  average  of  the  largest  and  smallest 
eigenvalues  of  the  coefficient  matrix  has  always  worked.  Finally,  like  the 
problems  associated  with  the  p-  version  of  the  finite-element  method,  the 
choice  of  domain  and  point  distribution  for  a given  number  of  points  is  an 
open  issue. 
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TABLE  I 


l>2  errors  for  the  solutions  to  Eq.  (20)  with  equal 
number  of  points  on  each  side  of  the  interface. 


N 

Error  in  u 

Error  in  v 

8 

1.57  x 10-2 

1.49  x 10-2 

16 

4.15  x 10“6 

4.86  x 10-6 

32 

1.91  x 10”9 

1.91  x 10-9 

TABLE  II.  L2  errors  for  the  solutions  to  Eq.  (20)  with  more 
points  to  the  right  of  the  interface. 


NL’NR 

Error  in  u 

Error  in  v 

8,  16 

1.22  x 10"2 

1.05  x 10~2 

12,  24 

2.45  x 10-4 

2.33  x 10-4 

16,  32 

3.93  x 10-6 

3.93  x 10"6 

TABLE  III.  L2  errors  for  the  solutions  to  Eq.  (20)  with 
more  points  to  the  left  of  the  interface. 


NL’NR 

Error  in  u 

Error  in  v 

16,  8 

9.80  x 10"3 

1.04  x 10-2 

24,  12 

1 

o 

iH 

X 

00 

• 

cn 

2.88  x 10-4 

32,  16 

1.49  x 10“6 

2.30  x 10-6 
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TABLE  IV.  Performance  comparison  for  single  and  multidomain  spectral 


computations. 

Grids:  Single  Domain  (SD)  17  x 9 points 

Multidomain  (MD)  (7  + 6 + 6)  x (5  + 5)  points 
(separated  by  domain) 


Maximum  Error 
SD 
MD 


1.85  x 10 
1.74  x 10 


Number  of  Steps  to  Reduce  Residual  Three  Orders  of  Magnitude 
SD  > 1500 

MD  780 


Average  Spectral  Radius 

SD  0.9964 

MD  0.9942 


Average  Time  per  Iteration 

SD  0.50  sec. 

MD  0.50  sec. 


TABLE  V.  Effect  of  streamwise  mesh  distribution 
on  Ringleb  calculation. 


Grid 

Division 

Maximum  Error 

8 + 8 

0.45  + 0.55 

7.8  x 10"4 

8 + 8 

0.50  + 0.50 

9.3  x 10”4 

16(SD) 

— 

1.9  x 10"3 

10  + 6 

0.34  + 0.66 

1.2  x 10-2 
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FIG.  1.  Diagram  of  the  two-dimensional  subdomain  structure  used  to  divide 


computational  domain. 


-2-10  1 2 

X 


FIG#  3a*  Solution  of  the  scalar  pulse  problem  Eq.  (19)  computed  on  two 
domains  shown  after  the  pulse  has  travelled  from  the  left  through 
the  interface  at  x = 0*  Computations  are  for  22  points  left  and  11 
points  right-  of  the  interface.  The  exact  solution  is  the  solid 
line;  computed  solutions  are  the  circles. 
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0 1 2 

X 

Solution  of  the  scalar  pulse  problem  Eq.  (19)  computed  on  two 
domains  shown  after  the  pulse  has  travelled  from  the  left  through 
the  interface  at  x = 0.  Computations  are  for  11  points  left  and  22 
points  right.  The  exact  solution  is  the  solid  line;  computed 
solutions  are  the  circles. 
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A 

FIG.  4a.  Pointwise  errors  as  X varies  for  the  situation  in  Figure  3a. 


153 
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FIG.  5a.  Graph  of  the  two  solutions  u (circles)  and  v (squares)  of  the 
linear  system  Eq.  (20)  with  22  points  on  the  left  and  11  points  on 
the  right.  The  exact  solutions  are  represented  by  the  solid  line. 
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X 

FIG.  5b.  Graph  of  the  two  solutions  u (circles)  and  v (squares)  of  the 
linear  system  Eq.  (20)  with  11  and  22  points  on  the  left  and  the 
right.  The  exact  solutions  are  represented  by  the  solid  line. 
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FIG.  6.  Plot  of  the  computed  pressure  in  a converging-diverging  nozzle  where 
the  interface  is  placed  at  the  sonic  point  at  x = /2.  Twice  as 
many  points  are  used  on  the  right  as  on  the  left  of  the  interface. 
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FIG.  8.  Single  domain  Chebyshev  grid  for  the  Ringleb  problem. 
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FIG.  10.  Multidomain  grid  with  six  subdomains  for  the  Ringleb  problem. 
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FIG.  12.  Comparison  of  residual  decay  for  single  domain  and  multidomain 
solutions  to  the  Ringleb  problem. 
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ON  SUBSTRUCTURING  ALGORITHMS  AND  SOLUTION  TECHNIQUES 
FOR  THE  NUMERICAL  APPROXIMATION  OF  PARTIAL  DIFFERENTIAL  EQUATIONS 

M.  D.  Gunzburger 
Carnegie-Mellon  University 


R.  A.  Nicolaides 
Carnegie-Mellon  University 


ABSTRACT 

Substructuring  methods  are  in  common  use  in  structural  mechanics  problems 
where  typically  the  associated  linear  systems  of  algebraic  equations  are 
positive  definite.  Here  these  methods  are  extended  to  problems  which  lead  to 
nonpositive  definite,  nonsymmetric  matrices.  The  extension  is  based  on  an 
algorithm  which  carries  out  the  block  Gauss  elimination  procedure  without  the 
need  for  interchanges  even  when  a pivot  matrix  is  singular.  Examples  are 
provided  wherein  the  method  is  used  in  connection  with  finite  element 
solutions  of  the  stationary  Stokes  equations  and  the  Helmholtz  equation,  and 
dual  methods  for  second-order  elliptic  equations. 
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1.  THE  SUBSTRUCTURING  ALGORITHM  IN  THE  POSITIVE  DEFINITE  CASE 

The  use  of  substructuring  techniques  in  the  numerical  solution  of  problems 
governed  by  positive  definite  partial  differential  equations  is  in  widespread 
use.  The  most  notable  case  is  found  in  structural  mechanics,  especially  in 
connection  with  the  equations  of  linear  elasticity.  For  the  sake  of 
simplicity,  here  we  describe  the  technique  for  the  Dirichlet  problem  for  the 
Poisson  equation.  Specifically,  suppose  we  want  to  solve 


-Au  = f in  ft 
u = 0 on  3ft 


(1) 


where  ft  is,  say,  an  open  bounded  region  in  with  boundary  3ft.  We 

subdivide  the  region  ft  into  open  subregions  ft^  i = l,*»*,m,  such  that 
m 

ft  = U if  and  ft  Oft.  = 0 for  i * j . We  denote  by  r . , 1 < i < i < m 
1=1  1 1 J ij  - - 

the  interfaces  between  regions  ft.^  and  ft^ , i.e.,  r^j  - T^fHftj . Of  course, 
for  particular  choices  of  i and  j in  a given  subdivision,  may  be 

empty.  A sketch  of  a particular  example  with  m = 5 is  given  in  Figure  1. 
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We  also  subdivide  ft  into  a finite  difference  or  finite  element  grid 

which  in  practice  is  much  finer  than  the  above  subdivision  of  ft  into  m 
subregions.  We  choose  the  two  subdivisions  so  that  the  interfaces  T ^ 

coincide  with  edges  of  the  finite  difference  or  finite  element  cells.  The 

discretization  of  (1)  proceeds  in  the  usual  manner.  The  essence  of  the 
substructuring  algorithm  is  found  in  the  particular  choice  for  the  ordering  of 
the  unknowns  and  equations,  i.e.,  columns  and  rows,  in  the  linear  system 
resulting  from  the  discretization  of  (1).  Specifically,  all  unknowns  and 

equations  associated  with  the  interior  of  a substructure  ft^  are  numbered 

sequentially,  one  substructure  at  a time,  and  unknowns  and  equations 
associated  with  the  interfaces  are  grouped  together  and  numbered  last. 

For  example,  in  a typical  finite  difference  discretization  of  (1),  one 
associates  equations  and  unknowns  with  nodes  in  the  grid.  In  this  case,  we 

would  group  together  all  the  unknowns  in  subregion  ft^  together  and  number 

them  first,  then  proceed  to  ft^,  etc.,  and  finally  to  ft  . Then  we  would 

number  all  the  unknowns  along  the  interfaces  T . . 1 < i,  j m.  The 

equations  would  be  numbered  in  the  same  way.*-  Likewise,  in  a finite  element 
discretization  of  (1),  some  unknowns  (trial  functions)  and  equations  (test 
functions)  are  associated  with  nodes  or  edges  and  these  are 


^The  subdivision  and  numbering  method  described  here  applies  to  difference 
methods  with  stencils  involving  only  nearest  neighbors.  The  method  may  be 
extended  in  an  obvious  manner,  e.g.,  by  defining  the  interfaces  to  be  more 
than  one  grid  point  in  thickness,  to  methods  having  stencils  with  a greater 
degree  of  connectivity. 


167 


numbered  in  the  same  manner  as  in  the  finite  difference  case  above In 
addition  there  may  be  test  and  trial  functions  more  naturally  associated  with 
the  finite  elements  themselves,  and  the  equations  and  unknowns  associated  with 
these  functions  are  grouped  together  with  the  other  ones  associated  with  the 
interior  of  the  corresponding  subregion 

The  net  result  of  the  above  numbering  schemes  is  that  the  linear  system 
resulting  from  the  discretization  of  (1)  has  the  form 


In  (2),  the  matrices  , i = l,«*»,m,  in  the  finite  element  case,  result  in 

the  case  of  both  the  test  and  trial  functions  being  associated  with  the 

interior  of  the  subregion  il  9 i = l,***,m,  respectively,  while  the  matrix 

Aq  results  from  test  and  trial  functions  associated  with  the  interfaces  T. . , 

ij 

1 1 < i m.  The  matrices  C^,  and  B^,  represent  trial,  respectively  test, 


2 

Again,  the  method  described  here  applies  to  the  case  where  the  test  and  trial 
functions  vanish  outside  the  elements  which  contain  the  associated  node  or 
edge.  However,  by  defining  the  interfaces  to  be  one  or  more  elements  thick, 
the  method  may  be  easily  extended  to  other  cases,  e.g,,  cubic  B-spline  test 
and  trial  functions. 
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functions  associated  with  the  interior  of  and  test,  respectively  trial, 

functions  associated  with  the  interfaces.  The  vectors  Ui,  i = l,***,m, 

respectively  denote  the  unknowns  associated  with  the  interior  of 
i = l,*«»,m,  while  Uq  denotes  the  unknowns  associated  with  the  interfaces. 
All  of  these  associations  can  also  be  made  in  the  finite  difference  case. 

It  is  well-known  that  the  coefficient  matrix  of  the  linear  system  (2), 

resulting  from  a discretization  of  (1),  is  symmetric  and  positive  definite. 

XT  x 

Indeed,  A^  = A^,  for  i = l,«*»,m  and  Aq  = Aq.  It  is  also  easy  to 

see  that  the  matrices  A^,  i = l,**«,m,  are  themselves  positive  definite.  In 

fact,  these  matrices  are  exactly  the  ones  which  would  result  from  the 

analogous  discretization  of  the  problems 

Au  = f in 

(3) 

u = 0 on  3ft^ 

for  i « l,***,m,  where  3ft^  denotes  the  boundary  of  Note  that  this 

boundary  may  consist  of  both  interfaces  and  a portion  of  the  boundary  3Q  of 
Q,  as  is  the  case  for  and  ^5  Figure  1,  or  may  consist  wholly 

of  interfaces  as  is  the  case  for  in  that  figure.  Discretization  of  (3) 

results  in  a linear  system  with  a coefficient  matrix  A^,  and  thus  A^  is 
clearly  symmetric  and  positive  definite.  We  note  that  even  in  the  case  of  the 
Neumann  problem,  i.e. , the  boundary  condition  in  (1)  is  replaced  by  3u/3n  **  0 
on  3Q,  the  matrices  A^  in  (2)  would  still  be,  at  least  in  the  finite 

o 

element  case,  symmetric  and  positive  definite.  This  is  so  because  the 

problem  (3)  associated  with  the  matrix  A^  is  now  given  by 
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Au  = f in 


— = o on  an.nan 
9n  i 


(4) 


u = o on  , j = l,**»,m, 


where  we  have  set  Since  is  never  empty,  the  matrix 
associated  with  (4)  is  symmetric  and  positive  definite.^ 

With  the  matrices  A^,  i = l,»»»,m,  being  positive  definite,  one  may 
proceed  to  solve  (2)  by  a block  elimination  procedure.  Symbolically,  we  may 
express  the  first  m stages  of  this  procedure  by  the  relations 


Ui  = Ai1(Fi  " Bi  V’  1 " <4 5> 

which  uniquely  express  in  terms  of  data  and  the  interface  unknowns  Uq. 

The  last  stage  of  the  process  requires  the  solution  of  the  linear  system 


where 


DUq  - G 


D " Ao  - j,  ci  k'i  \ and  G " Fo  ’ jj  ci  ‘I1  Fi 


(6) 

(7) 


^If  on  3^0 3J2  something  other  than  Dirichlet  data  is  specified,  then  the 
matrix  A^  also  contains  rows  and  columns  associated  with  test  and  trial 
functions  associated  with  nodes  or  edges  on  that  portion  of  the  boundary. 

40f  course,  the  fact  that  Ap  i = 0,...,m,  are  positive  definite  may  be 

deduced  directly  from  the  fact  the  coefficient  matrix  of  (2)  is  positive 
definite,  i.e.,  the  former  is  a necessary  condition  for  the  latter. 


170 


Of  course,  in  (5)  and  (7)  the  inverses  are  not  explicitly  computed,  but  rather 
appropriate  linear  systems  are  solved.  The  solvability  of  the  system  (6) 
follows  whenever  the  system  (2)  is  solvable.  In  fact,  if  the  system  (2)  is 
positive  definite,  so  is  the  matrix  D [1],  Once  (6)  is  solved  for  Uq,  (5) 
yields  U^,  I = ].,•••, m. 

Although  we  have  described  the  substructuring  algorithm  in  the  context  of 
the  Poisson  equation,  the  method  can  be  applied  in  a similar  manner  to  any 
positive  definite  problem.  As  noted  above,  the  method  has  encountered  great 
success  in  structural  mechanics  problems.  However,  in  other  fields  where  the 
governing  equations  are  not  positive  definite  or  symmetric  one  may  still  order 
the  equations  and  unknowns  to  produce  linear  systems  such  as  (2),  but  these 
may  not  always  be  solved  by  a standard  block  elimination  procedure.  In  the 
next  two  sections  we  describe  a procedure  to  solve  (2)  even  in  the  case  of  the 
matrices  A^  being  singular  and  show  how  the  method  may  be  implemented 
through  an  elimination  procedure.  In  Section  4 we  describe  examples  which 
lead  to  singular  matrices  A^.  Finally,  in  Section  5 we  give  some  concluding 
remarks. 

Incidentally,  in  almost  all  situations  the  use  of  a properly  implemented 

substructuring  algorithm  will  result  in  savings  in  computational  costs  when 

compared  to  a banded  elimination  procedure.  For  example,  consider  a 

discretization  of  Poisson^s  equation  on  a unit  square.  Suppose  we  have  M 

subregions  in  each  direction  so  that  ra  = and  suppose  that  each  subregion 

is  further  subdivided* by  introducing  an  n x n grid.  Thus,  there  are  a total 

4 4 

of  Mm  points  in  each  direction.  Banded  elimination  requires  0(M  n ) 

operations,  while  the  above  substructuring  algorithm  can  be  implemented  in,  at 
4 4 3 

most,  0(Mn  + M n ) operations.  We  note  that  this  particular  problem  is  not 
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particularly  well-suited  for  substructuring  methods.  Also,  the  relative 
advantage  of  substructuring  is  greater  when  one  considers  three-dimensional 
problems  or  systems  of  partial  differential  equations. 

We  also  note  that  substructuring  ideas  in  connection  with  preconditioning 
techniques  have  been  discussed  in  [2]. 


2.  THE  SOLUTION  ALGORITHM  IN  THE  GENERAL  CASE 

We  begin  by  describing  a method  for  solving  (2)  in  the  case  where  the 
matrices  are  singular.  The  algorithm  described  here  is  a special  case  of 

a more  general  algorithm  which  applies  to  arbitrary  matrices  with  arbitrary 
subdivisions  into  blocks,  e.g.,  the  matrix  has  no  special  structure  and  the 
matrices  A^  may  not  only  be  singular,  but  may  even  be  rectangular.  The  more 
general  algorithm  is  described  in  [3].  We  will  describe  the  algorithm  as 
applied  to  (2)  and  we  will  make  use  of  pseudo-inverses  in  order  to  simplify 
the  initial  presentation.  However,  we  emphasize  that  the  algorithm  may  be 
implemented  without  the  need  for  the  explicit  calculation  of  any  pseudo- 
inverses; such  an  implementation  is  discussed  in  the  next  section.  This  is 
similar  to  the  observation  that  the  algorithm  contained  in  ( 5 ) — ( 7 ) may  be 
implemented  without  explicitly  computing  any  inverses,  e.g.,  by  solving  linear 
systems. 

The  system  (2)  is  equivalent  to 

Ai  Ui  + Bi  U0  = V 1 = (8) 

j,  ci  ui  + *0  uo  - F0-  <«> 
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Now,  may  be  orthogonally  decomposed  in  the  form 


ui  = Yi  + Z±,  1 = 1, •••,»,  (10) 

where 

Ai  Zi  = 0,  i = l,»**,m,  (11) 

and  Y^  is  orthogonal  to  all  vectors  satisfying  (11).  In  particular, 

YY  Z±  = 0,  i = 1 , • • • ,m.  (12) 

Substitution  of  ( 10)— ( 11)  into  (8)  yields  that 


Ai  Yi  - Fi  - Bi  V 


i = 1,. 


y TO  • 


(13) 


Since  Y^  is  orthogonal  to  the  null  space  of  A^,  (13)  yields  that 

Yi  “ Ai(Fi  " Bi  V*  i = !,••*, m,  (14) 


where  denotes  the  pseudo-inverse  of  This  relation  states  that 

is  uniquely  determined  from  the  data  and  Uq.  Note  that  (8)  yields  no 
information  concerning  as  is  to  be  expected  since  A^  - 0, 

Substituting  (10)  and  (14)  into  (9)  yields  that 


m 

DU0  - G - I c z 

i=l 

where 


(15) 
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(16) 


D " 40  - j,  Ci  A1  B1  *nd  G * F0  - j,  C1  k+i  F 


i=l 


i=l 


We  may  also  decompose  Uq  in  the  form 


where 


U0  = 


Yo  + zo 


DZ0  " 0 


(17) 


(18) 


and  Yq  is  orthogonal  to  all  vectors  satisfying  (18).  In  particular, 


TJ  zo  ■ °- 


(19) 


Substitution  of  (17)— (18)  into  (15)  yields  that 

m 

DY  = G - l C Z (20) 

i-1 

and,  since  Yq  is  orthogonal  to  the  null  space  of  D,  (20)  yields  that 

Yq  - D+(g  - l C Z ).  (21) 

i-1 

Again,  it  is  not  surprising  that  (15)  yields  no  information  concerning  Zq. 
Substitution  of  (17)  and  (21)  into  (14)  then  yields  that 


[F  - B D+(G  - l C 
j=l  J 


Zj)]- 


Bi  Z0 


(22) 


for  i = 1 , • • • ,m. 
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At  this  point  we  have  shown  that  Y^,  i = 0,«**,m,  may  be  uniquely 
expressed  in  terms  of  Z^>  i = by  (21)  and  (22).  It  remains  to  show 

how  to  find  the  latter.  The  first  step  is  to  multiply  (13)  by  (i  - A*)  • 

Since  A^  A*  A^  = A^,  , we  have  that 


(I  - A±  A^)(Fi  - B±  UQ)  =0,  i = 1 , • • • ,m. 


or  substituting  (17)  and  (21), 


+ + m 

(I  - k±  Ai)[F1  - B±  ZQ  - Bi  D (G  - ^ Cj  Z ^ ) ] =0,  i = l,...,m 


(23) 


Now  suppose  we  are  able  to  determine  bases  for  the  null  spaces  of  A^, 
i = l,»*«,m,  and  D.  We  collect  each  of  these  basis  sets  into  matrices  N^, 
i = 0 , • • • ,m,  i.e.,  N^,  i = 0,»«»,m,  have  linearly  independent  columns. 


DNq  = 0 and  A±  N = 0,  i-l,»«*,m,  (24) 

and  the  columns  of  Nq,  respectively  N^,  span  the  null  space  of  D, 
respectively  A^,  i = l,»»»,m.  The  number  of  columns  in  is,  of  course, 
the  dimension  of  the  corresponding  null  spaces.  Now,  we  may  write  that 


zi  = Ni  V 1 = <25> 


for  some  vectors  A^.  Substituting  (25)  into  (23)  then  yields  that 


^ R±j  Aj  = H±,  i = I,**., m,  (26) 
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where 


and 


RiO  " (I  " Ai  At)Bi  N0»  Hi  = (I  " Ai  Al)tFi  " Bi  D+  G1 
Rij  = C1  " A±  Ai)Bi  D+  Cj  Nj ’ j = l,***,m. 


(27) 


(26)  may  be  expressed  in  the  form 


RA  - H. 


(29) 


In  general,  R is  a rectangular  matrix.  The  number  of  rows  in  R is  equal  to 
the  sum  of  the  number  of  rows  of  the  matrices  A^,  i = l,***,ra,  and  the  number 
of  columns  of  R is  equal  to  the  sum  of  the  dimensions  of  the  null  spaces 
of  A^,  i = l,»»«,m;  and  D.  It  can  be  shown  [3]  that  the  system  (29)  is  a 
consistent  system,  and  we  may  find  its  solution,  for  example,  by  forming 

(RTR)A  = RTH.  (30) 

Suppose  we  can  solve  (30)  for  A.  Then  (28)  yields  A^,  i = 0,«»»,m,  (25) 
then  yields  Z^,  i = 0,»*»,m,  (21)  and  (22)  yields  Y^,  i = 0,»»»,m,  and 
finally  (10)  and  (17)  yield  the  solution  U^,  i = 0,»*»,m,  of  (2). 


176 


The  algorithm  described  here  is  related  in  the  following  manner  to  the 
block  elimination  algorithm  in  Section  1.  Suppose  that  the  matrix  of  (2)  and 
all  the  A^s  and  D are  nonsingular.  Then,  the  algorithm  of  this  section 
reduces  to  the  standard  block  Gauss  elimination  procedure.  Indeed,  in  this 
case,  = A^ , D+  = D ^ and  = 0 so  that  and  the  latter  are 

determined  uniquely  by  (14)  and  (21).  Note  the  correspondence,  in  this  case, 
between  ( 14 )— ( 15)  and  (5)-(6). 

In  the  more  general  case,  i.e.,  some  or  all  of  the  A^s  and  D being 
singular,  it  can  be  shown  [3]  that  the  rank  deficiency  of  (30)  is  exactly  that 
of  the  original  coefficient  matrix  in  (2).  Therefore,  if  the  latter  is 
nonsingular,  then  so  is  RTR  and  then  A in  (30)  is  uniquely  determined. 
Since  the  Z^s  and  Y^'s  are  uniquely  determined  from  A,  the  algorithm 
produces  the  unique  solution  of  (2).  If  the  matrix  of  (2)  is  singular,  so 
is  R R and  (30)  does  not  have  a unique  solution.  However,  (30)  may  be 
solved  anyway,  either  in  terms  of  arbitrary  parameters  or  by  adding 
constraints.  The  number  of  parameters  or  constraints  is  equal  to  the 
dimension  of  the  null  space  of  R^R  which  in  turn  is  the  same  as  the 
dimension  of  the  null  space  of  the  coefficient  matrix  in  (2).  In  any  case, 
once  a particular  A is  determined,  then  and  are  also  determined. 

In  particular  applications  to  the  solution  of  partial  differential 

equations,  the  dimension  of  the  system  (30)  is  small  compared  to  that  of  the 

system  (2).  Indeed,  typically  dim(R  R)  = 0(m) , the  number  of  subregions. 

For  example,  the  dimension  of  the  null  spaces  of  the  matrices  A^  and  D may 

T 

be  one  or  zero,  in  which  case  dim(R  R)  < m + 1. 
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3.  AN  ELIMINATION  IMPLEMENTATION 

We  begin  by  restating  the  algorithm  of  the  previous  section.  Given  the 
matrices  AQ,»»»,Am>  Bj,»«»,Bm>  Cj,»«»,Cm  and  the  vectors  we  find 

vectors  ^o’***’^m  satisfyinS  (2)  by  the  following  procedure. 

1.  Compute  A*  F^  and  A*  B^  for  i = l,»»»,m. 

2.  Compute  , i = l,«»»,m,  whose  columns  constitute  a basis  for  the  null 
space  of  A^,  i = l,«»«,m,  respectively. 

3.  Compute  C^(A*  B^) , C^(A*  F^)  and  C^  for  i = l,«»»,m. 

m m 

4.  Compute  D = A^  - £ Ci(Ai  Bi)  and  G = fq  ~ ^ Ci^Ai  Ff)* 

i=l  i=l 

5.  Compute  D+  G. 

6.  Compute  Nq  whose  columns  constitute  a basis  for  the  null  space  of  D. 

7.  Compute  D+  C^  for  i = l,»»»,m. 

8.  Compute  the  matrices 

Ri0  “ Bi  N0  " Ai  ^Ai  Bi^N0  for  1 " 

Rij  = Bi^D+  Cj  Nj)  ' VAi  Bi^D+  Cj  V f°r  i>j  = 1’***»m 

and  the  vectors 

Hi  » Fi  - Bi(D+  G)  - A^A*  F)  + A±(A*  Bi)(D+  G)  for  i ■ l,»»»,m. 

9.  Assemble  the  results  of  step  6 into  the  matrix  R and  vector  H 

T T 

according  to  (28)  and  then  compute  R R and  R H. 

T T 

10.  Solve  the  linear  system  R RA  = R H for  A and  then  compute  A^ 
i = 0,»««,m,  according  to  the  partition  of  (28). 

11.  Compute  A^  for  i = 0,»»*,m. 
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+ m . m 

12.  Compute  Y = (D  G)  - £ C N A » (D  G)  - T C,  2,. 

1=1  1 1=1  1 1 

13.  Compute  Uq  = Yq  + Zq. 

14.  Compute  Y±  = (A+  F±)  - (A+  Bi)UQ  for  i - lf...,m. 

15.  Compute  = Y±  + Z±  for  i = l,...,m. 

Other  than  steps  1,  2,  5,  6,  and  10,  the  above  algorithm  requries  only  matrix 
and  matrix-vector  multiplications.  In  this  section  we  show  how  to  carry  out 
the  other  operations  required  by  the  algorithm  through  an  elimination 
procedure.  In  particular,  we  will  not  need  to  explicitly  calculate  any 
pseudo-inverses  of  matrices. 

We  first  describe  how  to  carry  out  steps  1 and  2.  Consider  the  linear 
system. 

At  Q = S=  (Bi,Fi,0)  (31) 

where  the  right-hand  side  matrix  S consists  of  the  matrix  B^,  the  vector 
Fi>  and  some  additional  columns  of  zeroes.  The  number  of  these  additional 
columns  should  be  greater  or  equal  to  the  dimension  of  the  null  space  of 
Ai*  This  dimension  will  actually  be  determined  during  the  elimination 

procedure.5  We  now  proceed  to  solve  (31)  by  Gauss  elimination  with  partial 
pivoting.  If  the  matrix  A^  is  singular,  then  one  or  more  times  during  the 
elimination  procedure  we  will  not  be  able  to  locate  a nonzero  pivot  element. 
In  fact,  the  number  of  times  this  occurs  is  exactly  the  dimension  of  the  null 
space  of  A^.  However,  at  such  an  occurrence,  the  corresponding  column  is 


See  Section  5 concerning  the  effects  that  roundoff  errors  may  have  on  the 
determination  of  this  dimension. 
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already  in  the  eliminated  form  so  that  we  may  skip  over  to  the  next  column  and 
continue  the  elimination  process.  At  the  end  of  the  process,  (31)  has  been 
reduced  to  the  form 

A Q = J = (B  ,F  0)  (32) 

where  A^  is  upper  triangular  and  in  row  echelon  form.  When  A^  is 

singular,  A^  will  have  zeros  at  the  pivot  location  for  exactly  those  columns 
for  which  no  nonvanishing  pivot  element  was  found. 

We  now  proceed  to  backsolve  (32).  No  difficulty  is  encountered  until  a 

row  is  reached  for  which  the  pivot  entry  of  A^  is  zero.  For  the  columns 

of  Q corresponding  to  and  F^,  we  may  arbitrarily  set  (to  something 

other  than  zero)  the  entry  in  the  row  corresponding  to  the  zero  pivot  of  A^ 
Then  the  backsolve  procedure  may  continue  until  we  reach  another  zero  pivot 
entry,  at  which  time  we  again  arbitrarily  specify  an  entry  in  the  columns  of 

Q corresponding  to  the  columns  B^  and  F^  of  S.  While  all  this  is  going 

on  we  are  also  solving  (32)  for  the  columns  corresponding  to  the  zero  columns 
of  S.  For  these  columns,  whenever  a zero  pivot  entry  is  encountered  in  A^ , 
one  of  the  elements  in  the  corresponding  row  is  set  to  one  while  the  rest  are 

set  to  zero.  Each  time  a zero  pivot  entry  is  encountered,  a different  column 

is  chosen  for  which  one  sets  the  arbitrary  element  to  one.  At  the  end  of  this 
backsolve  procedure,  (32)  yields  that 

Q = (L,K,N±). 

A 

Here  the  columns  of  form  a basis  for  the  null  space  of  Aj_  and  L and 

A 

K are  particular  solutions  of  the  systems. 
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A±  L = B1  and  K = F . 


(33) 


A A 

The  final  step  is  to  orthogonalize  the  columns  of  L and  K with  respect 
to  the  columns  of  to  yield 

Q = (l.k.nJ. 

Since  A^N^  = 0,  L and  K are  still  solutions  of  (33).  Moreover,  the 

columns  of  L and  K are  orthogonal  to  the  null  space  of  Aj^  and, 

therefore,  are  minimum  norm  solutions.  By  the  uniqueness  of  the  minimum  norm 
solution,  we  have  that 

L = A*  B±  and  K = A*  F^,. 

Thus  the  above  elimination  procedure  has  accomplished  the  tasks  of  steps  1 and 
2 of  the  algorithm* 

The  tasks  of  steps  5,  6,  and  7 can  be  accomplished  in  an  analogous 

manner.  Also,  if  the  matrix  RTR  is  nonsingular,  then  it  may  be  easily 

solved  by  an  ordinary  Gauss  elimination  procedure.  If  it  is  singular  then  a 
solution  in  terms  of  arbitrary  parameters  may  be  determined  in  a manner 
similar  to  the  above  procedure  for  the  system  (31).  We  note  that  any  sparsity 
or  structure  inherent  in  the  matrices  A±  may  be  exploited  in  the  above 
procedure.  However,  in  general,  the  matrix  D will  be  dense.  We  will  return 
to  this  point  in  the  concluding  section. 
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4.  EXAMPLES 


The  Stationary  Stokes  Equation 

Consider  the  stationary  Stokes  equations  for  the  slow  flow  of  a viscous 
fluid  in  a bounded  region  in  l£.  These  are  given  by 

Aii  - grad  p = £ in  ft 

div  u = 0 in  ft  (34) 

_u  = 0 on  3ft. 

Here  ii  denotes  the  velocity,  p the  pressure,  f^  the  given  body  force  and 
the  viscosity  coefficient  has  been  absorbed  into  p and  f..  Clearly,  the 

pressure  cannot  be  determined  uniquely  since  we  may  add  an  arbitrary  constant 
to  the  pressure  and  still  satisfy  (34). 

A finite  element  approximation  of  the  solution  (u,p)  of  (34)  may  be 
defined  as  follows.  Given  finite-dimensional  spaces  Vn  and  Sn  for  the 

discrete  velocity  and  pressure  fields,  we  seek  u^ey*1  and  p^€S^  such  that 

/(grad  uh : grad  - p*1  div  ^h)dft  = -/  _f*v^  dft  for  all 
ft  ft 

(35) 

/ q1  div  u1  dft  = 0 for  all  q^€S^. 
ft 

Here  we  assume  that  the  elements  of  V*1  satisfy  the  boundary  condition  in 
(34).  By  choosing  bases  for  the  spaces  V*1  and  S*1,  (35)  can  be  expressed  as 
a linear  algebraic  system  for  the  coefficients  in  the  basis  function 
expansions  of  u*1  and  p*1. 
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Now  it  is  well-known  that  arbitrary  choices  of  spaces  V*1  and  S*1  may 
not  yield  stable  or  accurate  solutions.  However,  there  are  now  known  many 
element  pairs  for  which  (35)  yields  optimally  accurate  solutions  [4],  [5], 

[6].  One  such  pair  is  described  as  follows.  Suppose  denotes  a 

triangulation  of  the  region  ft.  We  denote  by  a finer  triangulation 

derived  from  by  subdividing  each  triangle  in  into  four  congruent 

triangles  by  joining  thp  midsides.  See  Figure  2.  We  define  S*1  to  consist 
of  piecewise  constant  functions  over  the  triangulation 


Figure  2.  A triangle  in  and  the  corresponding  triangles  in  V^. 

■u 

and  Vn  to  consist  of  piecewise  linear  functions  over  the  triangulation  V, 

h 

which  are  continuous  over  ft  and  vanish  on  9ft.  This  combination  is  known  to 

be  stable  and  be  optimally  accurate  [6].^  The  basis  functions  for  are 

easily  associated  with  the  vertices  of  the  triangulation  V,  while  the  basis 

n 

functions  for  S*1  are  associated  with  the  triangles  in  the  triangulation  S^. 


^See  below  for  the  necessary  restriction  on  the  pressure  which  yields  this 
result. 
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Now  let  us  consider  a substructuring  technique  for  the  solution  of  (35). 
We  assume  that  the  interfaces  between  subregions  are  made  up  of  edges  of 

the  triangulation  so  that  these  interfaces  do  not  cut  across  pressure 

triangles.  One  may  easily  arrange  a numbering  scheme  for  the  unknowns  and 
equations  which  yields  a linear  system  of  the  form  (2).  For  example, 
consists  of  all  velocity  unknowns  associated  with  vertices  of  located  in 

the  interior  of  the  subregion  and  all  pressure  unknowns  associated  with 

the  triangles  of  which  are  also  in  ft^.  Note  that  Uq  contains  only 

velocity  unknowns,  namely  those  associated  with  vertices  which  lie  on  the 

interfaces  T ^ but  not  on  3ft. 

We  have  not  constrained  the  pressure  space  and  therefore  the  system  (2) 

corresponding  to  this  discretization  of  (34)  is  singular.  In  fact,  its  rank 

defficiency  is  one,  and  the  null  vector  corresponds  to  the  pressure  function 

which  is  constant  over  ft.  On  the  other  hand,  the  velocity  approximation  is 

uniquely  determined  by  (2)  [6].  Furthermore,  it  is  easy  to  see  that  the 

submatrices  A,,**»,A  are  singular.  In  fact,  these  matrices  are  exactly 
i m 

those  which  arise  from  the  analogous  discretization  of  the  problem. 

Aii  “ grad  p = f_  in  ft_^ 

div  u_  = f in  ft^ 

u = 0 on  8ft^. 

Thus  each  of  the  matrices  A^  has  a single  local  pressure  null  vector,  i.e., 
the  dimension  of  N^  is  one  and  N^  corresponds  to  the  pressure  function 
which  is  constant  over  ft^.  On  the  other  hand,  since  the  velocity  field  can 
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be  uniquely  determined  from  (2)  and  since  Uq  consists  of  only  velocity 
unknowns,  the  matrix  D in  the  linear  system  (15)  is  nonsingular,  i.e., 

Nq  = 0.  Thus,  in  this  case,  the  system  (30)  has  dimension  m and  has  a one- 
dimensional null  space,  the  latter  following  from  the  fact  that  the  system  (2) 
itself  has  a one-dimensional  null  space. 

If  we  choose  the  pressure  space  S*1  to  consist  of  piecewise  linear 
functions  over  the  triangulation  which  are  continuous  over  ft,  while 

retaining  the  same  velocity  space,  the  situation  changes  drastically.  For 
example,  now  the  basis  functions  for  S*1  are  more  easily  associated  with  the 
vertices  of  S^.  Now  contains  pressure  unknowns  corresponding  to 

vertices  in  which  are  in  the  interior  of  ft^  or  lie  on  Sft^PlSft.  More 

important,  Uq  now  contains  pressure  unknowns  associated  with  vertices  of 

which  lie  on  T . . but  not  on  3ft.  In  this  case  the  matrices  Aj  are 

i 3 1 

nonsingular  and  the  matrix  D is  singular  with  a one-dimensional  null  space. 

The  Helmholtz  Equation 

Now  consider  the  problem 

Au  + Xu  = f in  ft 

(36) 

u = 0 on  8ft 

where  X is  not  near  an  eigenvalue  of  the  operator  -A.  Standard  finite 
element  or  finite  difference  discretizations  of  (36)  yield  linear  algebraic 
systems  with  coefficient  matrices  which  are  symmetric  and  indefinite,  but 
which  certainly  may,  by  using  a partial  pivoting  strategy,  be  stably 
inverted.  Now  consider  the  following  specific  situation.  Let  ft  be  the 
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square  (0,it)  x ( 0 , tt ) and  let  X = 13/A.  Since  the  eigenvalues  of  -A  for 

this  region  are  given  by  (n^*  + m ),  m,n  = 1,2,*»*,  we  see  that  X = 13/4  is 

not  an  eigenvalue  and  therefore  the  problem  (36)  leads  to  nonsingular 
coefficient  matrices.  Now,  suppose  we  consider  solving  (36)  by  using  the 

substructuring  algorithm  with  the  two  subregions  Sly  = (0,2tt/3)  x (0,tt)  and 
^2  = (2tt/3,7t)  SI  ( 0 , tt ) • Then  the  matrices  in  (2)  correspond  to  the 

coefficient  matrix  for  the  analogous  discretization  of  the  problem 

Au  + Xu  = f in 

(37) 

u = 0 on 

2 2 

But  the  eigenvalues  of  -A  for  the  region  Sly  are  given  by  (n  + 9m  /4), 
m,n  = 1,2,3,««»,  so  that  X = (13/4)  is  an  eigenvalue  of  -A  for  the  region 
Sly  and  therefore  the  matrix  is  singular  even  though  the  system  (2)  is 

not. 

Admittedly,  this  example  is  somewhat  pathological  in  the  sense  that  for 
random  choices  of  regions,  subregions,  and  parameters  X,  the  probability  is 
zero  that  the  matrices  A^  in  (2)  will  be  singular.  However,  for  particular 
choices  of  X , SI  and  one  or  more  of  the  matrices  A^  may  be  singular; 

after  ^11,  the  above  example  is  not  really  all  that  far-fetched.  Of  course, 
if  any  of  the  A^"s  are  singluar,  the  situation  may  be  remedied  by  choosing  a 
different  subdivision  of  the  region  SI;  this  in  turn  implies  a complete 
reassembly  of  the  coefficient  matrix  in  (2).  On  the  other  hand,  the  algorithm 
of  Sections  2 and  3 may  be  used  whether  or  not  any  of  the  matrices  A^  are 
singluar. 
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There  is  a small  but  nonvanishing  probability  that  for  some  of  the 
problems  (37)  X,  although  not  an  eigenvalue  of  -A  for  the  region  ft^,  is 
close  to  such  an  eigenvalue.  If  X is  close  enough  to  such  an  eigenvalue, 
the  matrix  A^,  in  finite  precision  arithmetic,  may  be  mistakenly  determined 
to  be  singular  by  the  algorithm  of  Section  3.  However,  this  will  be  the  case 
only  when  the  difference  between  X and  an  eigenvalue  is  much  smaller  than 
the  discretization  error,  i.e.,  of  the  order  of  the  unit  roundoff  error  of  the 
machine,  and  no  serious  effect  on  the  accuracy  of  the  solution  should  result. 


Dual  Methods  for  Second-Order  Elliptic  Equations 

For  a third  example,  we  consider  dual  methods  for  second-order  elliptic 
partial  differential  equations.  An  example  of  these  are  methods  based  on  the 
complementary  energy  principle  in  linear  elasticity.  For  simplicity,  we  here 
consider  the  problem 


and 


u = V<f>  in  ft 

div  ii  = f in  ft 

u»ii  =0  on  Tj 

<()  = g on  r9 


(38) 


where  again  = ^ denotes  the  boundary  of  the  bounded  region  ft  Cl? 

and  n denotes  the  unit  outer  normal  to  3ft.  A finite  element  approximation 
of  (38)  may  be  obtained  by  choosing  finite-dimensional  spaces  and  S*1 

and  then  seeking  utVn  and  <J>  €Sn  such  that 
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J(uh-Vh  + ^ div  v^)dfl  = / gv^-tuift  ¥v^ev^ 

a r2 

J ^ div  u1  dft  = J f\f>^ 

a a 


We  assume  that  the  elements  of  V*1  satisfy  the  boundary  condition  on  in 
( 38 ) • The  boundary  condition  on  (j)  is  natural  in  this  formulation,  which  is 
one  of  its  advantages. 

In  [7],  the  following  choice  of  and  S*1  was  shown  to  yield  stable 
and  optimally  accurate  approximations,  at  least  for  polygonal  domains.  First, 
we  subdivide  ft  into  quadrilaterals,  and  then  subdivide  each  quadrilateral 
into  four  triangles  by  drawing  the  diagonals.  For  V*1  we  take  all  continuous 
piecewise  linear  vector  fields  with  respect  to  the  resulting  triangulation  and 
then  define  S*1  = div  V*1.  The  resulting  space  S*1  can  be  shown  to  be  a 
subspace  of  all  piecewise  constants  over  the  triangulation.  See  [7]  for 
details. 


In  the  implementation  of  the  substructuring  algorithm,  we  assume  that  the 


interfaces  T ^ coincide  with  some  of  the  edges  of  the  quadrilaterals  which 
initially  defined  our  finite  element  triangulation  of  ft,  i.e.,  the  interfaces 


do  not  cut  through  any  of  these  quadrilaterals.  The  test  and  trial  functions 
from  V*1  are  associated  with  nodes  while  those  from  S ^ are  associated  with 


the  interior  of  the  quadrilaterals.  The  matrices  in  (2)  now  correspond 

to  the  discretization  of  the  problem 


u = V(f)  and  div  ii  = f in  ft^ 
u;n  =0  on  r^OSft^,  <j>  = g on  r^OBft^^ 


(39) 
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and 


u = 0 on  T , ,09ft . . 

- ij  i 

Because  of  the  last  boundary  condition,  the  problem  (39)  is  over  constrained 
insofar  as  the  variable  n us  concerned.  Nevertheless,  if  = 0, 

i.e.,  a given  subregion  does  not  have  part  of  its  boundary  coincide  with  that 
part  of  9ft  on  which  data  for  <j>  are  given,  then  the  problem  (39)  can  only 
determine  <j>  to  an  additive  constant.  This,  for  example,  would  be  the  case 
for  subregion  ft^  in  Figure  1,  i.e.,  an  interior  subregion.  For  such 

situations,  i.e.,  = 0,  the  matrix  in  (2)  will  again  be  singular, 

with  a one-dimensional  null  space.  Since  (38)  always  uniquely  determines  \x9 
the  matrix  D of  (16)  will  be  nonsingular.  The  rank  deficiency  of  the  system 
(30)  will  be  one  or  zero,  depending  on  whether  or  not  T ^ has  vanishing 
measure,  i.e.,  whether  or  not  the  problem  (38)  uniquely  determines  <j>. 


5.  CONCLUDING  REMARKS 

Determination  fo  Zero  Pivot  Elements 

A crucial  step  in  the  elimination  algorithm  presented  in  Section  3 is  the 
determination  of  when  all  the  elements  in  a column  to  be  eliminated  are 
already  zero.  This  is  necessary  for  the  determination  of  the  null  spaces  of 
the  matrices  and  D.  In  practice  one  would  declare  an  element  to  vanish 

whenever  its  magnitude  is  less  than  some  prescribed  tolerance  which  should  be 
proportional  to  the  unit  roundoff  error  of  the  machine.  This  naturally  leaves 
open  the  possiblity  of  a very  small  but  nonzero  element  being  mistaken  for  a 
vanishing  element.  This  situation  can  be  avoided,  at  least  when  one  is 
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solving  partial  differential  equations,  by  first  using  high  enough  precision 
arithmetic,  e.g.,  60  or  64  bit  floating  point  arithmetic,  and  by  making  sure 
that  the  algorithms  used  are  stable.  The  former  is  easily  arranged,  while  the 
latter  points  out  the  importance  of  rigorous  mathematics.  Indeed,  if  an 
algorithm  is  stable,  as  are  the  ones  discussed  in  Section  4,  and  the  machine 
precision  is  high  enough,  one  should  not  encounter  nonzero  elements  which  are 
comparable  in  magnitude  to  the  unit  roundoff  error  unless  the  matrix  in  hand 
is  singular  or  very  nearly  singular. 

An  alternative  to  the  use  of  elimination  type  procedures  is,  of  course,  to 
employ  methods  based  on  orthogonal  transformations.  At  the  price  of  greater 
computational  expense,  such  methods  are  less  susceptible  to  ill  effects  due  to 
roundoff  error. 


Parallelism 

One  of  the  attractions  of  substructuring  algorithms  is  the  obvious 
inherent  parallelism  both  in  the  assembly  and  solution  stages.  The  sets  of 
matrices  and  vectors  (k±,K±>C±,¥ ±) , i=  l,***,m,  can  each  be  assembled 
independently.  Furthermore,  at  least  in  the  finite  element  case,  we  may  write 
the  matrix  Aq  and  the  vector  Fq  in  the  form 


m 

A°  = ill  AQi’ 


m 


I 

i=l 


0i 


(40) 


where  the  matrix  Aq^  and  the  vector  Fq^  represent  the  contribution  to  the 
matrix  Aq  and  vector  Fq  coming  from  region  0^.  Each  of  the  sets  (Aq^, 
F0i),  i = l,***,m,  may  be  assembled  in  parallel.  Thus,  in  the  assembly  stage, 
the  sets  (A^,B^,C^,F^,Aq^,Fq^) , i = !,•••, m,  may  be  assembled  in  parallel. 
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For  example,  each  of  the  above  sets  may  be  assembled  on  separate  processors, 
with  no  need  for  interprocessor  communications.  At  the  end  of  the  assembly 
process,  the  concatenations  of  (40)  must  be  performed.  This  step  is  not 
parallelizable,  but  represents  a minor  portion  of  the  assembly  process. 

There  is  also  a large  degree  of  parallelism  in  the  solution  algorithm 
described  at  the  beginning  of  Section  3.  Steps  1,  2,  and  3 are  completely 
parallelizable,  again  with  no  interprocessor  communications  necessary. 
Furthermore,  if  the  appropriate  information  can  be  transferred  to  the 
processors,  steps  7,  8,  11,  14,  and  15  and  a portion  of  step  12  can  also  be 
computed  in  parallel.  The  only  relatively  major  steps  which  are  not 
parallelizable  are  steps  5 and  6. 

The  issue  of  parallelism  in  connection  with  substructuring  algorithms  has 
been  studied  in  [8]  in  the  context  of  a specific  three-dimensional  positive 
definite  problem.  That  paper  contains  a discussion  of  operation  counts  which, 
for  the  most  part,  is  also  relevant  in  the  present  context. 

Three-Dimensional  Problems 

As  pointed  out  above,  the  major  nonparallel  steps  in  the  computation  are 
embodied  in  steps  5 and  6 in  the  algorithm  of  Section  3.  Even  on  a serial 
machine  these  steps  may  be  costly  since,  in  general,  they  involve  dense 
matrices.  In  two-dimensional  problems,  by  keeping  the  number  of  subregions 
relatively  small  compared  to  the  total  number  of  elements  in  the 
triangulation,  the  size  of  these  dense  calculations  can  be  kept  small,  i.e., 
the  size  of  D can  be  of  the  order  of  the  square  root  of  the  size  of  the 
A^s.  The  latter  usually  are  sparse,  e.g.,  banded.  A similar  arrangement  in 
three-dimensional  problems  would,  in  general,  lead  to  a matrix  D whose  size 
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is  of  the  order  of  the  two-thirds  power  of  the  size  of  the  Aj^s,  which  may  be 
unacceptably  large.  Furthermore,  in  steps  1 and  2 of  the  algorithm,  the 
number  of  right-hand  sides  would  be  approximately  equal  to  the  number  of 
columns  of  D and  the  size  of  the  A^'s  may  be  too  large,  when  relatively 
few  subregions  are  used.  Therefore,  for  three-dimensional  problems  one  must 
be  especially  careful  to  implement  the  algorithm  in  an  efficient  manner  as 
possible. 

These  potential  difficulties  can  be  mitigated  in  a variety  of  ways.  For 
example,  many  of  the  right-hand  sides  in  the  computations  of  step  1 of  the 
algorithm  are  zero  because  any  column  of  which  corresponds  to  an 
interface  unknown  which  is  not  associated  with  9^  would  vanish.  The 
corresponding  row  of  is  also  zero.  Thus,  one  can  avoid  computations 
involving  linear  systems  with  zero  right-hand  sides  and  multiplications  by 
zero  vectors.  The  savings  possible,  in  storage  and  computing  time,  by 
accounting  for  these  features  are  relatively  higher  for  three-dimensional 
problems. 

Although,  in  general,  the  number  of  interface  variables  may  be  large  for 
three-dimensional  problems,  in  practice  it  is  often  the  case  that  specific 
features  of  the  domain  ft  lead  to  a small  number  of  such  unknowns.  For 
instance,  in  a wing-fuselage  configuration,  it  is  natural  to  consider  the  wing 
and  fuselage  to  be  different  subregions  and  the  interface  between  these  two 
substructures  is  relatively  small  in  extent.  Indeed,  it  was  exactly  in  this 
type  of  application  that  the  terminology  "substructuring"  arose. 

Finally  we  consider  the  most  serious  problem,  namely  that  of  the  size  of 
the  matrix  D.  However,  even  here  a judicious  implementation  can  effect  great 
savings.  As  a simple  illustration  consider  the  subregion  structure  of  Figure 
3 where  we  have  now  labeled  the  interface  boundaries  by  , i = 1 , • • • ,m  - 1 • 
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Figure  3.  An  example  subdivision  of  the  region  ft. 

It  is  natural  to  order  the  interface  unknowns  Ug  one  interface  at  a time, 
e.g.,  first  those  on  F^,  then  those  on  T29  etc.  It  is  not  hard  to  see  that 
the  matrix  D for  this  example  is  block  tridiagonal,  i.e.,  the  unknowns 
corresponding  to  the  interface  T ^ are  connected  only  to  the  unknowns  on  the 
interfaces  and  ri+l*  By  taking  advantage  of  features  such  as 
this,  the  cost  of  step  5 and  6 of  the  algorithm  can  be  greatly  reduced, 
especially  in  three-dimensional  settings.  We  note  that  these  ideas  are 
similar  to  those  connected  with  one-way  direction  algorithms  for  positive 
definite  problems  [9]. 


193 


REFERENCES 


[1]  F.  Gantmacher,  The  Theory  of  Matrices,  Chelsea,  New  York,  1960. 

[2]  G.  Golub  and  D.  Mayers,  "Use  of  preconditioning  over  irregular  regions," 
in  Computer  Methods  in  Applied  Science  and  Engineering,  VI,  (R.  Glowinski 
and  J.  Lions,  Eds.),  1983,  pp.  3-14. 

[3]  M.  Gunzburger  and  R.  Nicolaides,  "Elimination  with  noninvertible  pivots," 
Linear  Algebra  Appl.,  64,  1985,  pp.  183-189. 

[4]  V.  Girault  and  P.-A.  Raviart,  Finite  Element  Approximation  of  the  Navier- 
Stokes  Equations,  Springer,  Berlin,  1979. 

[5]  J.  Boland  and  R.  Nicolaides,  "Stability  of  finite  elements  under 
divergence  constraints,"  SIAM  J.  Numer.  Anal.,  20,  1983,  pp.  722-731. 

[6]  J.  Boland  and  R.  Nicolaides,  "Stable  and  semistable  low  order  finite 
elements  for  viscous  flows,"  SIAM  J.  Numer.  Anal.,  22,  1985,  pp.  474-492. 

[7]  G.  Fix,  M.  Gunzburger,  and  R.  Nicolaides,  "On  mixed  finite  element 
methods  for  first-order  elliptic  systems,"  Numer.  Math.,  37,  1981,  pp. 
29-48. 


194 


[8]  L.  Adams  and  R.  Voigt,  "A  methodology  for  exploiting  parallelism  in  the 
finite  element  process,"  in  Proceedings  of  the  NATO  Workshop  on  High 
Speed  Computations,  (J.  Kowolik,  Ed.),  Springer-Verlag,  Berlin,  1984,  pp. 
373-392. 

[9]  A.  George  and  J.  Liu,  Computer  Solution  of  Large  Sparse  Positive  Definite 
Systems , Prentice  Hall,  Englewood  Cliffs,  New  Jersey,  1981. 


195 


Multiple  Laminar  Flows  Through  Curved  Pipes* 

Zhong-hua  Yang*  and  H.B.  Keller 
Applied  Mathematics,  Caltech,  Pasadena,  CA  91125 


Abstract 

The  Dean  problem  of  steady  viscous  flow  through  a coiled  circular  pipe  is 
studied  numerically  for  a large  range  of  Dean  number  and  for  several  coiling  ratios. 
We  find  that  the  solution  family,  as  parameterized  by  Dean  number,  has  numerous 
folds  or  limit  points.  Four  folds  and  hence  five  branches  of  solutions  are  found. 
We  speculate  that  infinitely  many  solutions  can  exist  in  this  family  for  some  fixed 
value(s)  of  D . More  resolution  and  higher  accuracy  would  be  required  to  justify 
our  conjecture  and  to  find  the  rule  of  formation  of  new  solution  branches. 
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1.  Introduction 


Following  the  early  work  of  Dean  (1927, 1928)  there  have  been  several  numerical 
studies  of  the  steady,  laminar,  viscous  flo\V  of  an  incompressible  fluid  through  a 
slightly  curved  pipe  of  circular  cross  section.  In  particular,  Dennis  (1980)  with 
Collins  (1975)  and  with  Ng  (1982)  have  computed  such  flows  when  the  coiling  ratio 
a/L  is  small.  Here  a is  the  pipe  radius  and  L is  the  radius  of  curvature  of  the 
axis  of  the  pipe.  Also  Van  Dyke  has  applied  the  Stokes  series  and  Dombes-Sykes 
technique  (1978)  to  this  problem.  In  all  of  this  work  the  crucial  parameter  is  the 
Dean  number,  D , defined  as 


D = G a3(^)1/2/fiu  (1.1) 

where  G is  the  constant  pressure  gradient  driving  the  flow,  fi  is  the  viscosity  and 
u is  the  coefficient  of  kinematic  viscosity.  For  small  D and  a/L  « 1 all  of  the 
results  agree. 

In  particular  for  a straight  pipe,  a/L  = 0,  the  flow  is  the  classical  Poiseuille 
flow.  However  a slight  curvature  of  the  pipe  axis  iiiduces  a centrifugal  force  on  the 
fluid  which  then  forms  a secondary  flow,  sending  fluid  outward  along  the  symmetry 
axis  and  returning  along  the  upper  and  lower  curved  surfaces.  Thus  a pair  of 
symmetric  vortices  is  superposed  on  the  Poiseuille  flow.  These  qualitative  features 
are  observed  in  all  of  the  previously  cited  references  for  D small  and  a/L  « 1. 
What  happens  as  D and  a/L  increase?  Few  of  the  previous  studies  consider 
a/L  = 0(1).  Further,  Van  Dyke’s  expansions  disagree  with  the  finite  difference 
calculations  for  larger  values  of  D.  And  in  Dennis  & Ng  (1982)  dual  solutions  are 
found  for  the  range  957.5  < D < 5000  ; that  is  a four  vortex  solution  is  computed 
in  addition  to  the  standard  two  vortex  flow  described  above. 

In  this  paper  we  attempt  to  clarify  the  situation  by  determining  the  structure 
of  the  families  of  solutions  that  exist  as  D varies.  In  addition  we  show  how  this 


structure  changes  as  a/i  increases  (to  0.3).  For  this  purpose  we  must  retain  the  full 
Navier-Stokes  equations  and  do  not  make  the  a/L  « 1 simplifications.  However 
no  dramatic  effects  are  found  as  a/L  increases.  Regarding  the  structure  with 
respect  to  D we  are  not  completely  successful.  Our  results  suggest,  in  analogy  with 
the  von  Karman  swirling  flows  (Lentini  Si  Keller  1980),  that  there  may  be  infinitely 
many  steady  flows  for  some  value  (or  interval)  of  D.  However,  we  have  found  only 
five  branches  of  such  flows  and  believe  that  more  numerical  accuracy  is  required  to 
completely  settle  the  question.  Indeed  our  first,  cruder  calculations  revealed  only 
three  branches  of  solutions.  Unfortunately  the  variation  in  flow  patterns  from  one 
branch  to  the  next  are  not  as  regular  as  those  in  the  von  Karman  swirling  flows, 
so  that  we  cannot  have  the  same  confidence  in  our  current  conjecture.  Also,  we  do 
not  see  analytical  regularities  in  the  five  flows  we  have  detected. 

After  our  study  was  completed  we  learned  of  related  calculations  in  curved 
tubes  by  Winters  and  Brindley  (1984)  and  by  Winters  (1984).  However  that  work 
is  mainly  concerned  with  tubes  of  rectangular  cross  section,  with  a brief  mention  of 
the  circular  case  in  Winters  and  Brindley  (1984).  Bifurcations  are  obtained  for  the 
rectangular  case  but  they  do  not  examine  the  results  we  study  here. 

In  section  2 we  formulate  the  problem  retaining  the  exact  equations  (valid  to  all 
orders  in  e = a/L).  Expansions  in  Fourier  series  are  introduced  in  section  3 to  get  a 
system  of  nonlinear  two-point  boundary  value  problems  for  the  Fourier  coefficients. 
Numerical  methods  are  introduced  in  section  4.  These  employ  centered  differences 
and  Newton’s  method  with  continuation  or  path  following  techniques  introduced 
by  H.B.  Keller  (1977).  The  results  are  presented  and  discussed  in  section  5. 

2.  General  Formulation 

We  employ  the  notation  used  in  Collins  Si  Dennis  (1975)  and  Dennis  Si  Ng 
(1982)  as  indicated  in  Figure  1.  The  circular  cross  section  of  the  tube  in  the  (x,  y)- 
plane  has  radius  a with  center  at  L on  the  x-axis.  The  tube  is  coiled  about  a 
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circle  of  radius  L in  the  (x,  z)-plane.  With  no  pitch  in  the  coil  the  tube  thus  forms 
a torus.  Our  equations  are  exact  for  this  case.  Dimensionless  velocity  components 
of  the  fluid  are  (u,v,u>)  at  a point  P with  dimensionless  polar  coordinates  (r,  o). 
Here  u is  the  radial  and  v is  the  angular  component  of  velocity  in  the  pipe  cross 
section,  w is  the  axial  velocity  normal  to  the  cross  section  and  r = r'/a  where  r'  is 
the  dimensional  radius. 

We  seek  flows  independent  of  6 , the  angular  deviation  from  the  (x,y)-plane. 
A stream  function  <j)(r , a)  is  introduced  in  terms  of  which  the  transverse  velocity 
components  are  given  by: 


u(r,  a)  = 
v(r,  a)  = 


i 21 

r(l  + er  cos  a)  da  ’ 
-1 

(l  + e r cos  a)  dr 


(2.1) 


Here  e = ajL  is  the  “coiling  ratio”  and  the  continuity  equation  is  thus  satisfied. 
Using  these  velocity  components  in  the  Navier-Stokes  equations  we  introduce  the 
modified  Laplacian 


V2  = 


1 + 6 r cos  a r d 


• cos  a r d , r d , / e sin  a d »■  . 

r Ldr'l  + cr  cos  a dr'  da  'l  + cr  cos  a da' J ’ ' 


and  the  vorticity 


n = — v2^ , 


(2.3) 


to  get  for  the  u;-momentum  equation 


V2w  + 


,d<j>  dw  d<j>  dw , _ 

' ^ <**  x-vy  a/  ' * 


r(l  + er  cos  a)  K dr  da  da  dr 


(2.4) 


and  on  elimination  of  the  pressure  from  the  other  momentum  equations: 


v2n  + 


,d<}>  dCl  . d<f> 

An  Qr  /• 


r(l  + tr  cos  a)  \dr  da 

2e  Q / . d<f)  cos  a dtf>^ 

+ 7- rr  sina  — 4-  — — ) 

(1  + e r cos  a)2  ''dr  r da' 

, . dw  cos  a dw  N 
(sma~  + — — ) . 


w 


(1  + e r cos  a)2 


da ' 


(2.5) 


The  equations  used  in  Dennis  (1980)  are  obtained  by  setting  e = 0 in  (2.l)-(2.5) 
(i.c.  they  use  the  small  coiling  ratio  approximation  but  we  do  not). 

Boundary  conditions  on  the  wall  of  the  tube,  r = 1 , yield: 


u;(l,  a)  = 4>{  1,  a)  = — (l,  a)  = 0 , 0 < a < 7r  . 


We  study  here  only  flows  symmetric  about  the  a>axis  for  which: 


(2.6) 


u;(r,  a)  = w(r, -a)  , <f>(r,  a)  = -<£(r,  -a)  , D(r,  a)  = — l“2(r,  -a)  . (2.7) 

Thus  on  the  symmetry  axis  we  have: 


dw  , . dw  , 

57(r’0)=a7(r’,r)  = 0' 
<£(r>°)  = <Kr*i r)  = 0 , 

fl(r, 0)  = fl(r, 7r)  = 0 . 


(2.8) 


3.  Fourier  Series  Expansions 

To  solve  the  boundary  value  problem  posed  in  (2.2)-(2.8)  we  seek  Fourier  ex- 
pansions of  the  stream  function,  axial  velocity  and  vorticity  in  the  forms: 
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oo 

а)  <f>(r,  a)  = ^ . fk{r)  sin  ha  ; 

k= 1- 

CO 

б)  tu(r,  a)  = ^ tWfc(Ocos  ka  ; 

k=0 

CO 

c)  Q(r,  a)  = ^ 0*(r)sin  ka  . (3.1) 

k=l 

With  these  forms  the  symmetry  conditions  (2.7)  and  the  implied  boundary  condi- 
tions (2.8)  are  satisfied. 

Using  the  expansions  (3.1)  in  the  differential  equations  (2.3)-  (2.5)  and  applying 
the  orthogonality  properties  and  other  identities  for  the  trigonometric  functions 
yields  an  infinite  system  of  coupled  nonlinear,  second  order  ordinary  differential 
equations  for  the  coefficient  functions  {fk(r),  tUfc(r),  <7fc(r)}  . Specifically  we  get 
from  (2.3),  with  the  notation  fo{r)  = <?o(r)  = 0 : 


ct  er 

= ~ ffk-i(r)  - ffk(r)  - — gk+i{r)  , k > 1 . 


(3.2) 


From  (2.4)  we  get,  with  tu_i(r)  = 0: 


er  r d 2 

*2  L IP* 


(k  — 1)  ( k 2) ) / \ . [ ^ 

^ J Wk-'{T>  +[d^+rd~r 


er  r d2 

+ 7 


{k  + l){k  + 2) 


] «0c+i(r) 


= Rk{r)  - 6ktle  r D - 6k<0D  , k>  0. 


(3.3) 
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From  (2.5)  we  get,  with  g~i{r)  = 0: 


(fc  — 2)(fc  — 3)l  ir\^<P  ,1  d (k  - l)(2k  - 3)  1 

(2J  I*’ 75 Ks<r>  + 7r*5  + 7 * 75 


f rj^_  1 _d fc^i  e2r2  d2  fc21 

\ Ldr2  r dr  r2J  2 -dr2  r2 . 


9k{r) 


*f 1 *£  *;  i - * <$*[ i - 

er  _ , . _ , . er  e 


- -T  Sk-i{r)  + Sk{r)  + ^ £*+i(r)  + Pk(r ) + ^ Qjt(r)  , /c  > 1 . 


(3.4) 


We  have  used  the  Kronecker  symbol  Sit)-  and  introduced  the  quantities  Rk,  Sk, 
Pk  and  Qk  as: 


a)  i2fc(r)  = 


_ (1  + frc.o)  1 


2r 


OO  / 

1 [ln“  *1  / |n— *1  (r)  + (n  + k)fn+k[r)  t^n(r) 
n=0  ^ ^ 


+ n[/n+*  + (n“fc)/|n-*|(r)]^n(r)| 

i 00  r 

b)  Sk(r)  — — | /|»-*|(r)  “ (»  + *)/«+*(»■)]  0^(r) 


- *[fn+k-  sign  (n-/:)/,,n^,(r)]<7ft(r)| 

1 00  , 

c)  Pk(r)  = - 55  | [w»(r)  - “wn(r)j  [(1  + <5*,n+l)u>|n+l--*|(r)  “ Wn+l+k(r)] 

“ [wn(r)  + - t»«(r)]  [(1  + £»-i,k)w|„_i_*|(r)  - Wn-i+k(r)  | 

d)  Qk{r ) = 55  [ |/n(r)  “ ^ /«(»•)]  [?n+i+^(r)  - sign  (n  + 1 - %|n+1_fc|(r)j 

” [/»(r)  + - /n(r)]  [sn-i+*(r)  - sign  (n  - 1 - %|n-i_k|(r)]  J 

(3.5) 

At  the  origin,  r — 0 , of  the  polar  coordinates  (r,  a)  continuity  requires  that 
<£(0,a)  , u>(0,a)  and  ft(0, a)  be  independent  of  a . From  (3.1)  we  thus  get  that: 


/k(0)  = wk(0)  = ffk(0)  = 0 , k = 1,2,.... 


(3.6) 
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Note  that  a condition  on  tuo(O)  is  not  obtained  but  iuo(0)  = tu(0,  a)  . The 
conditions  (2.6)  at  r = 1 applied  to  (3.1a,b)  yield: 


a)  fk{  1)=0,  fc  = 1,2,... 

b)  f'k{  l)=0,  k = 1,2,... 

c)  tujfe(l)  = 0 , * = 0,1,2,...  (3.7) 

The  formal  consistency  of  “order”  of  the  system  and  number  of  boundary 
conditions  seems  to  be  off  by  one  since  all  of  the  equations  are  second  order  and  we 
do  not  have  two  boundary  conditions  on  io0(r)  • This  is  easily  remedied  by  noting 
that  the  equation  in  (3.3)  for  k = 0 can  be  reduced  to  a first  order  equation.  To 
do  this  we  multiply  by  r and  integrate  over  [0,  r]  . In  evaluating  at  r = 0 we  use 
(3.6)  and  the  assumptions  that: 

Inn  [r  w'0  (r)]  = lim  [r2u>i(r)]  = 0 . 

The  result  is  the  first  order  equation: 

wo(r ) + j [~  toi(r)  - 2-  «>i(r)]  = — 2 n w“(r)  “ \ D • (3*4 * * * 8) 

n=l 

The  analytical  problem  is  thus  reduced  to  solving  (3.2)  for  k > 1 , (3.3)  for  k > 1 , 
(3.4)  for  k > 1 and  (3.8)  subject  to  the  boundary  conditions  (3.6)  and  (3.7). 

4.  Numerical  Procedures 

To  solve  or  rather  to  approximate  the  solution  of  the  problem  formulated  in 
Section  3 we  first  truncate  the  Fourier  expansions,  we  then  use  difference  approxima- 

tions on  the  resulting  system  of  O.D.E.s  and  finally  we  solve  the  nonlinear  difference 

equations  by  means  of  Newton’s  method  and  continuation  procedures.  We  describe 

these  techniques  below. 


A.  Truncation  of  the  Fourier  Expansions 


Under  the  assumption  that  the  series  in  (3.1)  converge  sufficiently  rapidly  we 
replace  them  by  the  finite  trigonometric  expansions  obtained  by  setting 


fk[r)  = Wk(r)  = gk(r)  = 0 , k>K.  (4.1a) 

When  we  use  (4.1)  in  the  equations  (3.2)-(3.8)  we  obtain  a system  of  3 K second 
order  and  one  first  order  ordinary  differential  equations  for  the  3 K + 1 quantities: 


/*(»•).  9k{r)  , 1 <k<K;  wk{r)  , 0 <k<K.  (4.1  b) 

there  are  6K  + 1 boundary  conditions  in  (3.6)  and  (3.7)  when  we  terminate  those 
relations  at  k = K . We  seek  to  solve  this  two-point  boundary  value  problem 
numerically. 

B.  Difference  Approximations. 

We  place  a uniform  grid  of  points  ry  = jh  , 0.<  j < M + 1 with  rM+1  = 1 on 
the  interval  0 < r < 1 . At  each  point  of  this  grid  we  introduce  approximations  to 
the  coefficients  in  (4.1b)  with  the  notation 


fk  (ry)  = fk,j  , 9k  (ry)  = gkj  , wk  (ry)  = wkJ 
We  employ  the  difference  operators,  for  any  mesh  function,  say  uy  : 


D* ^-=—u  -<  Do 


2 h 


Then  the  discrete  or  difference  approximations  to  (3.2),  (3.3)  and  (3.4)  are  taken 
to  be: 
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£i  [D+D.-{k  1]ik  2)]  h-u  + [d+D.  + ±O0  - £]  /« 

/ L r y j * L rj  rj  J 

cry  r (fc  + l)(fc  + 2)l  _ ’cry  cry 

+ ~ D+D-  ^2  j/fc+i,j—  2 9k-i,j  2 ^fc+1»J’ 


b+x>-  - (<:  2)]  + [r>+i>-  + f Co  - « 

/ l “y  J *■  ry  ry J 

cry  f (/c  + l)  (/c  + 2)  l 

+—  [£>+£>- ^2 J ~ ^,icry 


cry\2  [p  p (fc-2)(fc-3)i  cry  r 1 (fc  - 1) (2*  - 3)  1 . 

— J D+D ^ J 9k- 2,3  + ~Y  [2 D+D-  + — Do  ^ J 9k-i,s 


+ \D+D 


1 fc2i  c2r?  r ^2ll 

' + v + “ [D+D- - 7j\j  9tJ 


, cr  Ln  r , 1 P {k  + l){2k  + 3)l  /cry 

+y  [2P+D-  + - Do ^ J gk'+ij  + [~y 


O’  b+c.  - {-  + 2)f  + 3) 


J Qk+ZJ 


- ~Y  Sk- 1,3  + Sk+l,3  + ~Y  Sk  + 1,3  + Pk,3  + 2 5 (4-4) 

Each  of  these  difference  equations  is  imposed  for 

j = 1,2,...,M  , 

7c  = 1,2,...,  if  . 

The  quantities  .R^y  , £*ty  , Pfc(y  , and  Q^y  are  the  obvious  finite  difference 
approximations  to  the  quantities  in  (3.5)  centered  at  ry  . Since  only  first 

derivatives  occur  in  these  expressions  we  employ  Downyj  to  approximate 
<(ry)  , etc.  The  remaining  first  order  equation  (3.8)  is  centered  at  the  points 
ry_i  = (j  — -|)/i  as  follows: 


L J—  2 J 

_ l n^hhi  + l Ihizl'j  ^Wn'i  + Wn>3-i 

^^“3  n=l  ^ ^ 


2 /Wi,y  + U>i,y_i 


ry-|  7 > (4*5) 
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for 


3 — 1>2, . . . ,*M  + 1 . 

The  boundary  conditions  (3.6)  and  (3.7a,c)  go  over  into  the  corresponding 
conditions: 

a)  fk,o  = Wk,o  = 9k, o = 0 , k = ; 

&)  fk.M+i  ~ wk,M+i  ~ 0 > k = 1,2,...  ,K  ; tyo,M+i  = 0 . (4.6) 

The  remaining  conditions,  in  (3.7b),  are  imposed  in  order  to  retain  second  order 
accuracy  as: 


n / - / k,M +2  ~ /fc.M  „ , , „ 

vo  //c,Af+i  = 2/^ = 0 > k = l,2,...,K. 

Of  course  the  meshpoint  r^+2  is  not  in  [0, 1]  and  so  the  values  fk,M+‘2  seem 
extraneous.  However  they  are  eliminated  by  imposing  the  difference  equations  in 
(4.2)  at  j = M + 1 . The  result,  after  using  (4.6b)  and  the  above,  is  for  c = 0 : 


QktM +1  ^2  f k,M  > k 1,2, ...  , K • 


(4.7) 


For  e > 0 we  must  add  the  terms: 


“ [s*-l,M+l  +S*+i,M+l  + £>+£>_  {fk-l,M  + fk+l,M)] 

The  numerical  problem  is  to  solve  the  nonlinear  system  of  difference  equations 
in  (4.2),  (4.3),  (4.4),  (4.5)  and  (4.7).  These  form  3KM  + K + M + 1 equations. 
There  are  precisely  that  many  unknowns  {fkj  , wklJ-  , gkJ  } when  the  quantities 
in  (4.6)  are  eliminated.  We  go  further  and  use  (4.7)  to  eliminate  the  K quantities 
{Pfc.M+i}  • Then  we  have  only  {ZK  + 1 + 1 equations  and  unknowns. 
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C.  Newton's  Method  and  Continuation. 


To  solve  the  difference  equations  we  use  Newton’s  method  combined  with  con- 
tinuation procedures  to  insure  good  initial. estimates  of  the  solution  as  the  param- 
eters are  varied.  To  do  this  efficiently  the  unknowns  must  be  ordered  in  a manner 
that  simplifies  the  structure  of  the  Jacobian  matrix.  To  describe  our  ordering  we 
first  introduce  the  vectors  / . , g , and  w . of  dimensions  K,  K and  K + 1 , 
respectively,  by: 


Cj  = /2,y»  • • • » SkJ)  » 

gj  = {9ij > 02,/,  • • • , 9k j ) , 1 < j < M + 1 ; 

u; T = (u>o,y,  wltj, . . . , wjf.y)  , 1 < j < M . (4.8) 

Recall  that  (4.7)  gives:  <7  = — 1 3 / (for  the  case  c = 0)  and  so  0 can 

be  eliminated.  The  remaining  (3 K -f  l)M  + 1 unknowns  are  represented  in  the 
vector  X defined  by: 


X = (w0,o;  , g^, 


(4.9) 


Now  we  order  the  equations  in  a corresponding  manner.  That  is  for  a fixed  j- 
value  (:.c.  meshpoint)  we  take  (4.5)  and  all  of  (4.2),  (4.3)  and  (4.4)  for  1 < k < K . 
The  equations  ordered  in  this  manner  for  j = 1,2,...,M  and  finally  (4.5)  for 
j = M + 1 can  be  written  as  a vector  equation  in  the  form 


G(X;D,e)  = 0.  (4.10) 

Here  G has  (3 K + 1)  components,  each  being  one  of  the  difference  equations.  We 
have  indicated  the  dependence  of  these  equations  on  the  parameters  D and  e as  they 
play  a special  role  in  the  continuation  procedures.  For  a fixed  value  of  D and  e we 


denote  a solution  of  (4.10)  by  X = X(D,  e)  . When  D — 0 and  e = 0 an 
exact  solution  of  the  continuous  problem  is  given  by  Poiseuille  flow.  Thus  we  easily 
get  a solution  of  the  discrete  problem  in  this  case.  As  D or  e deviates  from 
zero  we  can  use  the  Poiseuille  flow  as  an  initial  estimate  of  the  discrete  solution  in 
Newton’s  method  applied  to  the  system  (4.10).  This  gives  a sequence  of  iterates 
{X^  (D,  c) } defined  by: 

a)  X^  ( D , e)  = initial  estimate  , 

b ) SX(XM;D,£)  [X|rtl)-XW]  = -G(XM;I>,£),  ■/  = <>, 1,2 (4.11) 

Here  Gyr  is  the  Jacobian  matrix  which  as  a result  of  the  above  indicated  ordering 

(V 

has  the  block-band  structure  indicated  below.  Each  square  block  is  a matrix  of 
order  (3 K + l)  x (3 K + ll  . There  are  M 


such  rows  of  blocks.  This  array  of  blocks  is  bordered  by  one  row  and  column  os 
shown.  All  other  elements  in  Gyr  are  zero.  Most  of  the  computing  effort  goes  into 
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solving  the  linear  algebraic  systems  in  (4.11b).  Thus  to  reduce  the  number  of  times 
this  must  be  done  we  seek  accurate  initial  estimates. 

One  way  to  obtain  good  initial  estimates  is  to  use  two  terms  in  a Taylor  ex- 
pansion of  the  solution  with  respect  to  changes  in  the  parameter  D , say.  Thus  we 
use: 


X(0]  {D  + 6D , e)  = X(D , e)  + 6DXd  ( D , e)  (4.12a) 

To  obtain  XD  we  note,  from  (4.10),  that  it  satisfies: 


Gx(X{D,e);D,e)XD  = -gc(X(£,£);B,£)  (4.126) 

This  system  is  similar  to  those  in  (4.11b).  In  fact  when  Newton’s  method  has  con- 
verged, the  last  time  we  solve  (4.11b)  we  can  also  solve  (4.12b)  and  thus  Xd(D , e)  is 
determined  with  little  extra  work  (t.e.  only  the  backsolves  and  evaluation  of 
Gd  need  be  done).  Continuation  with  respect  to  e can  be  done  in  an  exactly 
similar  manner. 

The  method  described  in  (4.11),  (4.12)  is  known  as  Euler-Newton  continuation. 
It  is  extremely  effective  and  usually  converges  quadratically.  There  are  many  refine- 
ments regarding  step  length  procedures,  efficient  solution  of  the  block-banded  linear 
systems,  approximation  of  Jacobians,  etc.,  which  we  do  not  discuss  here.  Failure  of 
the  method  to  converge  does  occur,  however,  and  it  usually  signals  the  presence  of 
a bifurcation  or  fold  point  on  the  solution  path  (or  family)  being  generated.  Such 
points  or  solutions  are  called  singular  because  the  Jacobian  matrix  evaluated  at 
these  solutions  is  singular.  Almost  all  such  singular  points  are  what  we  call  simple 
folds  or  limit  points.  In  particular  a simple  fold  with  respect  to  D is  a singular 
solution,  say  [X0,  Dq , €q]»  which  has  the  properties  that: 
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a)  dim  JV(G^)  = 1 ; (i.e.,  all  solutions  of 

G°x<f>  = 0 are  <f>  = a <f>Q  ,•  a € dt  , some  <£q  ± 0) 

(4.13) 

b)  Gp  ^(£x)  (*-e-  ( Qd >V>)  7^  0 for  all  solutions  of 

«$)T*-o). 

Here  G^  = Qx{Xq]  D0,  e0)  and  = GD(XQ]D0,e 0)  • All  of  the  singular 
solutions  we  have  found  in  this  work  have  been  such  simple  fold  points.  We  have 
sought  bifurcation  points  but  have  found  none. 

It  is  not  difficult  to  circumvent  the  convergence  problems  near  fold  points.  We 
do  this  by  using  pseudo-arclength  continuation  as  introduced  in  Keller  (1977).  That 
is,  we  do  not  parametrize  the  solution  path  or  family  by  D (as  we  assume  has  been 
done  above)  but  rather  introduce  a new  parameter  s and  a new  scalar  constraint 
and  seek  to  solve  the  inflated  or  augmented  system: 


а)  G(X{s),D{s),e)  =0 

б)  /&(«„),  [x(s)-X(So)]) 

+i>(so)  \d(s)  - D(^)]  + (s-  so)  = 0 (4.14) 

Here  [£(<<>),  i>(*o)]  is  a previously  computed  solution  for  e fixed  in  the  present 

dX 

discussion  and  for  s = so  • By  X = and  D = ^ we  denote  the  components  of 
a tangent  vector  to  the  solution  path  {X(s),P(s)}.  The  constraint  (4.14b)  simply 
requires  that  the  point  [X(s),P(s)j  lie  on  the  plane  normal  to  this  tangent  at  a 
distance  (s  — so)  from  the  point  of  tangency. 

We  use  the  scheme  (4.14)  when  the  previous  Euler-Newton  scheme  begins  to 
show  signs  of  failure  (z.-e.  too  many  iterations  till  convergence).  We  solve  (4.14)  by 
Newton’s  method.  The  Jacobian  of  this  system  is 
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a(G,AQ 

3[X,D) 


(4.15) 


(Qx 

1%  N°) 

This  Jacobian  is  nonsingular  at  regular  solution  points  and  at  simple  fold  points. 
That  is  why  our  method  has  no  difficulties  in  computing  solution  paths  through 
folds.  To  solve  for  the  Newton  iterates  we  use  the  Bordering  Algorithm  described 
in  Keller  (1977)  which  is  designed  for  systems  with  coefficients  as  in  (4.15). 

By  differentiating  in  (4.14a)  with  respect  to  s we  find  that  [X(s),Z)(s)]  , the 
tangent  to  the  solution  path,  satisfies: 

GxX(s)  + GdD[s)  = 0 (4.16a) 

To  solve  this  we  first  solve 


and  then  set 


(4.166) 


X(s)  = D[s ) £(s)  (4.16c) 

However  since  the  scale  of  s has  not  been  determined  we  choose  it  to  represent 
(local)  arclength  along  the  solution  path.  Thus  we  require  that 

(X(s),X(5))  + P2(5)  = l 

and  using  (4.16c)  in  the  above  we  get 

£>(s)  = ±(yfl  -<£,£>)_1  (4.16(f) 

The  sign  here  is  chosen  so  that  (X(s),  X(sq))  > 0 which  determines  the  orientation 
along  the  solution  path. 
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We  determine  a new  tangent  only  after  having  solved  (4.14).  Then  we  replace 
[X(s0),P(>so)]  by  the  new  tangent  [X(s),  jD(s)]  and  proceed  as  before. 

5.  Results  of  Calculations 

In  addition  to  the  stream  function  and  axial  flow  velocity  we  have  computed 
Re  , the  Reynolds  number  based  on  the  mean  axial  velocity: 

Re  = 2y/2  f 1 u0(r)rdr  ; 

Jo 

and  the  friction  ratio  (ratio  of  curved,  7C  > to  straight,  7*  , wall  friction): 


- = 4'/57T 
7«  D 

We  have  computed  solution  paths  with  D varying  for  the  following  sets  of 
values  of  Fourier  truncation,  K , mesh  spacing,  h , and  coiling  ratio,  c: 


I. 

* 

II 

0 

h=  — ; 
40  ’ 

c = 0; 

II. 

II 

h-l 

0 

h=  60  ; 

£ = 0, 

•e  = 0.1  ; 

III. 

if  = 20  , 

e = 0, 

£ = 0.1  , £ = 0.2 

Starting  from  the  trivial  state  with  u = v = w = 0 for  e = 0 and  D = 0 we  used 
continuation  with  D increasing  as  described  in  Section  4.  In  each  of  the  three 
cases  a simple  fold  was  found  and  arclength  continuation  was  used  to  accurately 
locate  the  fold  and  to  traverse  it.  The  solution  branches  were  then  continued  with 
decreasing  D and,  in  each  case,  another  fold  was  found.  Again  these  folds  were 
located  accurately  and  traversed  to  obtain  a third  branch  in  each  of  the  three  cases, 
now  with  D increasing.  For  cases  I and  II,  extensions  of  these  third  branches 
continued  well  beyond  where  we  could  trust  the  numerical  results.  However  for 
case  III  a third  and  fourth  fold  were  found,  leading  to  five  branches  of  solutions. 
In  Table  1 we  list  the  critical  value  of  the  Dean  number,  Dm  , at  the  m-th  fold. 
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For  cases  II  and  III  the  fold  solutions  found  for  e = 0 were  continued  in  e up  to 
0.1  and  for  case  III  the  continuation  went  up  to  e = 0.2  . These  results  are  also 
given  in  Table  1. 

We  call  the  family  of  solutions  varying  continuously  with  D in  Dm-i  < D < 
Dm  the  “m-th  branch”  ( Dq  = 0).  Our  calculations  seem  to  suggest  that  the 
analytic  problem  has  infinitely  many  branches  although  we  have  computed  only 
five  of  them.  Graphs  of  ^c/l»  vs  D are  given  for  cases  I and  III  in  Figures  2 and 
3,  respectively.  On  the  first  branch,  that  emanating  from  D = 0 , the  solutions 
are  of  the  classic  form  described  by  Dean  — we  call  these  “two-vortex”  flows  (see 
Figure  4).  These  two-vortex  flows  persist  on  the  entire  first  branch  and  over  most 
of  the  second  branch  down  (in  D values)  to  about  D ~ 5000  where  four-vortex 
solutions  gradually  appear.  These  four-vortex  flows  are  formed  in  the  calculations 
by  the  development,  as  D decreases  on  the  second  branch,  of  a small  weak  pair  of 
vortices  about  the  axis  of  symmetry  near  the  outer  edge  of  the  tube.  This  vortex 
pair  grows  as  D decreases  and  persists  onto  the  third  branch  as  D then  increases 
(see  Figure  5).  The  four-vortex  flows  remain  on  the  entire  third  branch  and  onto  the 
fourth  branch  down  to  D k 14, 000  where  six-vortex  flows  appear.  We  believe  that, 
as  this  process  continues,  2n-vortex  flows  can  form  for  all  n = 1, 2, . . . . Indeed  on 
the  fifth  branch  we  have  computed  8-vortex  solutions  at  D « 25,000  (see  Figure 
9). 

In  Table  2 we  compare  our  computed  values  of  ^c/l « on  the  first  branch  with 
various  values  reported  in  the  literature  (for  two- vortex  flows).  The  agreement  is 
quite  good.  Dennis  and  Ng  (1982)  have  also  obtained  four-vortex  solutions  over 
957.5  < D < 5000  . We  claim  that  these  solutions  are  on  the  third  branch.  They 
were  obtained  accidentally  in  Dennis  and  Ng  (1982)  as  a result  of  convergence 
difficulties  with  increasing  D values  near  5000  . Then  as  D was  decreased  the 
solution  “jumped”  back  onto  the  first  branch.  This  is  typical  of  the  behavior  to  be 


expected  near  folds  if  no  special  technique  for  traversing  them  is  used.  Thus  the 
intermediate  second  branch  was  not  obtained  in  Dennis  and  Ng  (1982).  In  Table 
3 we  compare  the  values  of  the  four-vortex  solutions  obtained  in  Dennis  and  Ng 
(1982)  with  our  values  on  the  third  branch.  The  agreement  leaves  no  doubt  as  to 
the  identity  of  the  two  results.  The  somewhat  larger  discrepancies  at  D = 5000  is 
due,  we  believe,  to  inaccuracies  in  Dennis  and  Ng  where  convergence  difficulties 
occurred.  Graphs  of  the  stream  function  and  axial  velocity  contour  lines  on  the 
third  branch  also  agree  well  with  those  in  Dennis  and  Ng. 

Over  the  interval  D4  < D < D$  we  have  obtained  five  solution  branches.  To 
give  some  idea  of  how  the  solutions  change  we  show  in  Figures  4-8  plots  of  contour 
lines  of  the  stream  function  and  axial  velocity  for  the  solution  with  D — 8000  on 
each  of  the  five  branches.  In  addition  we  display  in  Figure  9 the  results  for  D = 
25,000  on  the  fifth  branch.  The  contour  lines  in  each  figure  are  at  levels  that  differ 
by  one  tenth  the  value  between  maximum  and  minimum  values  of  the  quantity 
plotted.  The  values  of  these  maxima  and  minima  are  given  with  each  figure.  The 
small  closed  contours  (or  almost  points)  near  the  maxima  or  minima  are  at  the 
levels  of  0.995  or  1.005  , respectively,  of  the  critical  values. 

Least  squares  fits  of  the  ryc/o,s  vs  D curves  with  e = 0 have  been  made  in 
the  form 


— = am  + bm  D1/3 


On  branches  m = 1, 2 and  5 we  get  the  coefficient  values: 


ai  = 0.3  , a 2 = 0.25  , Gg  = 0.15  and  6i  = 62  = 65  = 1/8  . 

Other  exponents  have  been  used  but  the  1/3  power  seems  to  fit  the  data  best. 
It  is  not  clear,  in  light  of  the  multiplicity  of  solutions  and  the  unsettled  nature 
of  the  solutions  for  large  D , what  the  significance  of  “asymptotic  solutions”  for 
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D * oo  can  be.  Thus  we  do  not  address  this  problem  here  but  merely  present  the 
above  fits  for  whatever  use  they  may  be. 

During  the  course  of  this  work  we  have  benefitted  from  conversations  with 
Prof.  A.  Acrivos.  We  also  wish  to  thank  Prof.  S.C.R.  Dennis  who  first  brought  the 
matter  of  multiple  solutions  to  our  attention  and  suggested  that  we  work  on  it. 
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Table  and  Figure  Captions 


Table  1.  Critical  Dean  number,  Dm  , at’the  m-th  fold  in  the  solution  branches. 

Table  2.  Comparison  of  7c/7s  on  the  two-vortex  solutions  of  various  works  with  the 
present  solutions  on  the  first  branch. 

Table  3.  Comparison  of  the  four-vortex  solutions  of  Dennis  and  Ng  (1982)  with  the 
present  solutions  on  the  third  branch. 


Figure  1.  The  tube  cross-sections  showing  coordinates,  velocity  components,  axial  flow 
distribution  sketch  and  cross-flow  streamlines  sketch. 

Figure  2.  Friction  ratio,  7c/7s  , vs.  Dean  number,  D , for  case  I:  K = 10, 

h = 1/40,  e = 0 . 

Figure  3.  Friction  ratio,  7c/7s  , vs • Dean  number,  D , for  case  III:  K = 20, 
h — l/60,  e — 0 . 

Figure  4.  Axial  velocity,  w , and  stream  function,  4>  , contour  lines:  D = 8000, 

K = 20,  h = 1/60,  e = 0.  First  branch:  Max  w = 0,  Min  tv  = 0 , 

Max  4>  = 23.986,  Min  0 = 0. 

Figure  5.  Same  as  in  Fig.  4.  Second  Branch:  Max  tv  = 625.956,  Min  tv  = 0 , 

Max  0 = 23.497,  Min  0 = 0 . 

Figure  6.  Same  as  in  Fig.  4.  Third  Branch:  Max  tv  = 594.777,  Min  tv  = 0 , 

Max  0 = 22.962,  Min  0 = -12.897  . 
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Figure  7.  Same  as  in  Fig.  4.  Fourth  Branch:  Max  tv  .=  613.697,  Min  tv  = 0 , 
Max  <f>  = 21.783,  Min  <t>  = -8. 71 6 

Figure  8.  Same  as  in  Fig.  4.  Fifth  Branch:  Max  tv  = 622.831,  Min  tv  = 0 , 

Max  <fi  = 20.679,  Min  <j>  = -4.676  . 

Figure  9.  Axial  velocity,  tv  , and  stream  function,  <f>  , contour  lines:  D = 25, 000, 

K = 20,  h = 1/60,  e = 0.  Fifth  branch:  Max  tv  = 1412.730,  Min  tv  = 0 , 
Max  <f>  = 31.494,  Min  <f>  = -14.335  . 
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ABSTRACT 

The  term  "vortex  breakdown"  refers  to  the  abrupt  and  drastic  changes  of 
structure  that  can  sometimes  occur  in  swirling  flows.  It  has  been  conjectured  that 
the  "bubble"  type  of  breakdown  can  be  viewed  as  an  axisymmetric  wave  travel- 
ling upstream  in  a primarily  columnar  vortex  flow.  In  this  scenario  the  wave’s 
upstream  progress  is  impeded  only  when  it  reaches  a critical  amplitude  and  it 
loses  stability  to  some  non- axisymmetric  disturbance.  We  will  investigate  the  sta- 
bility of  some  axisymmetric  wavy  flows,  which  model  vortex  breakdown,  to  three 
dimensional  disturbances  viewing  the  amplitude  of  the  wave  as  a bifurcation 
parameter.  We  will  also  look  at  the  stability  of  a set  of  related,  columnar  vortex 
flows  which  are  constructed  by  taking  the  two  dimensional  flow  at  a single  axial 
location  and  extending  it  throughout  the  domain  without  variation.  The  method 
of  our  investigation  will  be  to  expand  the  perturbation  velocity  in  a series  of  diver- 
gence free  vectors  which  ensures  that  the  continuity  equation  for  the  incompressi- 
ble fluid  is  satisfied  exactly  by  the  computed  velocity  field.  Projections  of  the  sta- 
bility equation  onto  the  space  of  inviscid  vector  fields  eliminates  the  pressure  term 
from  the  equation  and  reduces  the  differential  eigen  problem  to  a generalized 
matrix  eigen  problem.  Results  are  presented  both  for  the  one  dimensional,  colum- 
nar vortex  flows  and  also  for  the  wavy  "bubble"  flows. 
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1.  Introduction:  Vortex  Breakdown. 


The  term  "vortex  breakdown"  refers  to  the  abrupt  and  drastic  changes  of  structure  that  can 
sometimes  occur  in  vortex  flows.  Observations  by  Peckham  & Atkinson  [1957]  of  breakdowns 
occurring  in  the  leading  edge  vortex  formed  above  a swept  back  lifting  surface  and  a number  of 
studies  demonstrating  the  serious  aerodynamic  consequences  of  such  events  (the  slopes  of  the  lift, 
drag  and  moment  curves  are  all  altered  by  breakdown)  stimulated  early  interest  in  the  subject. 
Since  that  time  the  literature  on  vortex  breakdown  has  burgeoned.  The  interested  reader  is 
referred  to  review  articles  by  Hall  [1972]  and  Leibovich  [1978,  1984]  for  summaries  both  of  the 
experimental  observations  that  have  been  made  and  also  of  the  theories  that  have  been  proposed 
to  explain  them. 

Experimental  observations  are  most  easily  made  on  vortex  flows  confined  to  tubes  and  the 
bulk  of  the  available  data  is  for  such  cases.  In  one  apparatus,  used  by  a number  of  researchers, 
water  is  passed  radially  inward  through  a set  of  guidevanes  imparting  swirl  to  the  fluid  which 
then  enters  axially  into  a test  section  (a  frustrum  of  a cone  of  very  small  cone  angle),  by  means  of 
an  annular  channel  formed  between  a bellmouth  opening  on  the  section  and  a centerbody  whose 
tip  is  aligned  with  the  cone  axis.  The  boundary  layer  shed  from  the  tip  of  the  centerbody  forms  a 
well  defined  vortex  core  along  the  axis  of  the  test  section  and  dye  injected  through  the  tip  allows 
for  flow  visualization. 

With  this  type  af  apparatus  two  parameters  are  within  the  easy  control  of  the  experimen- 
talist, namely  the  amount  of  swirl  imparted  to  the  inlet  flow  and  the  volume  flow  rate  through 
the  tube  (effectively  the  Reynolds  number  of  the  flow).  As  the  Reynolds  number  is  increased  for 
a sufficently  large,  fixed  value  of  swirl  the  breakdown  assumes  one  of  two  characteristic  forms. 
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Both  of  these  are  characterized  by  a rapid  deceleration  of  the  axial  velocity  component,  occurring 
in  the  axial  distance  on  the  order  of  one  vortex  core  diameter,  followed  by  the  formation  of  a 
stagnation  point  and  (in  some  frame  of  reference)  a region  of  reversed  flow  along  the  axis.  The 
two  forms  are  easily  distinguished  in  flow  visualization  studies  as  in  one  form,  the  spiral  or  S type 
breakdown,  the  tracer  dye  assumes  a spiral  shape  rotating  in  the  same  sense  as  the  inlet  fluid, 
while  in  the  other  form,  the  bubble  or  B type  breakdown,  the  dye  assumes  a form  that  looks 
much  like  a body  of  revolution  placed  in  the  fluid.  Our  interest  will  be  in  this  latter  form  of 
breakdown  which  is  sketched  in  Figure  1.  Here  we  show  "ideal"  or  averaged  stream  surfaces  on 
which  the  fluid  particles  travel  in  helical  paths.  (Leibovich  [1978]). 

Faler  & Leibovich  [1977],  Garg  & Leibovich  [1979]  and  the  author  [unpublished  studies] 
have  used  the  non-invasive  techniques  of  laser  doppler  anemometry  to  measure  the  velocity  fields 
both  upstream  and  downstream  of  breakdown  events.  Outside  a thin  boundary  layer  along  the 
tube  wall  the  experimental  data  is  well  fitted  by  the  analytic  profiles, 


no  - 7«(  i - 

(1.1) 

W(r ) = W1  + W2e~ari 

(1.2) 

W and  V being  respectively  the  axial  and  azimuthal  velocity  components  while  Wv  W2 , Q and  a 
are  all  constants  (representative  values  are  given  by  Garg  & Leibovich  [1979]).  The  profiles  apply 
to  the  downstream  flow  only  in  the  mean,  as  the  flow  there  fluctuates  with  time. 

Upstream  of  the  recirculation  zone  the  flow  is  axisymmetric  and  steady.  After  breakdown 
the  vortex  core  expands  to  two  or  three  times  its  upstream  size  and  the  constant  W2  in  the  mean 
axial  velocity  profile  which  had  been  positive  upstream  (jetlike  flow)  becomes  negative  (wakelike 
flow).  Downstream,  within  a few  vortex  core  diameters  of  the  breakdown,  a turbulent  wake  is 


invariably  established  . This  transition  to  turbulence  "switched  on"  by  the  coherent  breakdown 
structure  provides  a further  incentive  for  its  study. 

Possibly  motivated  by  the  fact  that  the  flows  upstream  of  breakdown  can  be  made  to  have  a 
high  degree  of  axial  symmetry,  most  of  the  research  to  date  assumes  that  axially  symmetric 
processes  are  the  important  ones  in  vortex  breakdown.  As  only  axisymmetric  disturbances  can 
cause  a change  in  the  axial  velocity  component  as  measured  on  the  axis  and  as  a deceleration  of 
this  component  is  so  pronounced  in  breakdown,  it  is  clear  that  such  disturbances  play  an  impor- 
tant role.  Nevertheless,  all  transitions  occurring  in  vortex  flows  as  documented  by  Faler  [1976] 
are  nonaxisymmetric  and  the  flow  within  the  bubble  itself  is  unsteady  with  regular  low  frequency 
oscillations.  Furthermore,  the  stagnation  point  that  defines  the  start  of  the  recirculation  zone  is 
not  entirely  fixed  but  wanders  over  a short  range  of  the  axis  in  a seemingly  random  fashion. 

Leibovich  [1984]  proposed  the  following  plausible  scenario  for  the  bubble  type  breakdown. 
A finite  axisymmetric  disturbance,  triggered  off  downstream,  moves  upstream  in  a columnar  flow 
that  is  nearly  critical  in  the  sense  of  Benjamin  [1962].  (A  supercritical  flow,  in  this  classification, 
allows  for  the  upstream  propagation  of  infinitesimal  axisymmetric  waves  while  a subcritical  flow 
does  not).  Flows  of  the  form  (1.1,2)  can  indeed  support  axisymmetric  dispersive  waves  and  these 
can  propagate  upstream  in  some  situations  (Leibovich  [1970],  Randall  & Leibovich  [1973]).  Mov- 
ing in  this  direction,  the  cross  sectional  area  of  the  tube  decreases  causing  the  wave  to  amplify 
and  speed  up.  The  conjecture  is  that,  upon  reaching  some  critical  amplitude,  these  waves  lose 
stability  to  a non-axisymmetric  disturbance.  The  growth  of  the  asymmetric  mode  at  the  expense 
of  the  axisymmetric  wave,  drains  energy  from  it  and  this  causes  the  wave  to  equilibrate  at  some 
axial  location  in  the  diverging  tube,  much  as  is  seen  in  experiments. 
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Our  aim  is  to  study  the  stability  of  some  inviscid,  wavy  axisymmetric  flows  to  three  dimen- 
sional disturbances  with  the  amplitude  of  the  waves  viewed  as  a bifurcation  parameter.  We  start 
with  a columnar  flow  in  cylinderical  coordinates  of  the  form  (0,  Vj^r),  IV0(r)),  (e*8*  (1-1»2)).  (For 
arbitrary  C 1 functions,  VQ  and  W0,  all  such  flows  satisfy  Euler’s  equations).  We  then  seek 
axisymmetric  wavy  perturbations  to  this  flow  which  satisfy  the  equations  of  motion,  at  least 
approximately,  for  small  amplitude.  In  terms  of  a stream  function,  ip  and  a circulation,  k 
(Lamb  [1932])  Leibovich  [1972]  found  solutions  to  Eulers  equations  of  the  form, 


0(r,z) 

= ^0(r)  + «^(rM(*)  + °(e2)> 

(1.3) 

/c(r,z) 

= «0(r)  + «7(rM(*)»  + °(c2)> 

(1-4) 

where  x is  a moving  coordinate, 

x = z — dt 

(1.5) 

and  d is  a constant  that  must  found  in  the  calculation.  The  velocity  components 

are  given  by 

i a , 

« = -T-Vl 

r az 

(1.6) 

1 

V = K, 

r 

(1.7) 

i a . 

w = — • —rf). 

r or 

(1.8) 

The  columnar  base  flow  is  represented  by  ^0(r)  and  Ko(r)* 

The  amplitude  function,  A(z,t)  is  governed  by  a Korteweg  de  Vries  equation  which  has  both 
infinite  and  finite  period  solutions.  The  multiple  scales  analysis  that  was  used  to  obtain  these 
solutions  is  strictly  valid  only  for  long  period  waves  which  are  also  the  most  interesting  solutions 
from  a physical  point  of  view.  When  doing  the  stability  analysis  we  will  confine  our  attention, 
for  numerical  reasons,  to  solutions  of  the  finite  period,  2H  and  these  are  given  exactly  in  terms  of 
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cnoidal  functions  (Whitham  [1974]).  The  structure  functions,  ^6(r),  'l(r)  and  the  wave  speed  d 
are  determined  (numerically)  from  a second  order,  ordinary  differential  eigenvalue  problem. 

For  certain  base  columnar  profiles  d is  negative  and  consequently  the  axisymetric  wave  pro- 
pagates upstream.  Figure  2 is  a plot  of  the  streamlines  (1.3)  in  a frame  moving  with  the  wave  for 
such  a case.  The  base  columnar  profile  used  here  and  throughout  this  paper  is  a purely  swirling 
flow;  W0(r)  = 0 and  a = 14  in  the  notation  of  (l.l).  The  structure  function  $ has  been  normal- 
ized so  that  Max  <f>  = 1 and  for  this  flow  a recirculation  zone  (bubble)  first  appears  in  the  stream- 
line plot  when  the  amplitude  parameter,  e reaches  a value  of  0.0155.  For  the  value  of  e used  here 
the  plot  is  clearly  reminiscent  of  the  bubble  type  breakdown. 

Our  aim  is  to  study  the  stability  of  the  flows  (1.3,4)  to  three  dimensional  disturbances  view- 
ing e as  a bifurcation  parameter.  The  analysis  will  be  carried  out  in  a frame  moving  with  the 
wave,  i.e.  using  the  coordinates  (r,  x,  6).  As  the  base  flow  is  dependent  on  both  the  radial  and 
axial  variables,  r and  x,  the  stability  equations  separate  only  in  the  azimuthal  variable,  0.  It  will 
be  in  our  interest  also  to  study  the  stability  of  a related  columnar  flow  that  is  constructed  by  tak- 
ing the  two  dimensional  flow  (1.3,4)  at  a single  axial  station,  x = 0,  and  extending  it  throughout 
the  cylindrical  domain  without  variation.  For  obvious  reasons  we  will  refer  to  this  flow  as  the 
"mid-bubble"  columnar  flow  and  it  is  given  explicitly  as  follows, 


V,(r)  - 

W + “7<r)» 

(1.9) 

WM  - 

W'ofO  + 

(1.10) 

Plots  of  these  profiles  for  various  values  of  e are  given  in  Figures  3 and  4.  Provided  the  wavy 
flow  (1.3,4)  varies  only  slowly  along  the  axis  (as  it  will  do  if  the  period,  2 H of  A{x)  is  very  large), 
we  can  look  on  the  midbubble  profiles  as  models  for  the  full  two  dimensional  flow.  We  conjecture 
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that  the  stability  of  these  columnar  flows  (the  equations  for  which  separate  in  both  x and  0) 
should  also  be  indicative  of  the  stability  properties  of  the  full  two  dimensional  flow. 

In  the  rest  of  this  paper  we  describe  the  numerical  scheme  used  to  solve  the  stability  equa- 
tions, we  discuss  its  implementation  and  verification  on  the  computer  and  finally  we  give  results 
obtained  for  the  stability  of  the  midbubble  columnar  and  the  axisymmetric  wavy  flows  presented 
above. 


2.  Numerical  Methods. 

For  incompressible  fluids  the  physical  law  of  mass  conservation  reduces  to  the  constraint  that  the 
velocity  vector  of  the  fluid  be  divergence  free.  The  pressure  is  not  then  a thermodynamic  variable 
determined  by  an  equation  of  state  but  rather  can  be  thought  of  as  a Lagrange  multiplier  adjust- 
ing itself  instantaneously  to  ensure  that  this  kinematical  constraint  on  the  velocity  vector  is  met. 
There  is  no  evolution  equation  for  the  pressure  nor  does  it  satisfy  any  predetermined  boundary  or 
initial  conditions. 

Numericists,  seeking  to  solve  the  governing  equations  approximately,  have  found  that  their 
greatest  difficulty  lies  in  the  treatment  of  the  pressure  variable.  While  many  ingenious  methods 
have  been  devised  to  overcome  the  difficulties,  the  treatment  advocated  in  this  work  is  in  a 
mathematical  sense  the  most  natural  and  offers  many  computational  advantages.  Here,  the  pres- 
sure term  is  eliminated  from  the  equations  entirely  and  the  divergence  free  condition  is  satisfied 
exactly  by  the  numerically  obtained  approximation  to  the  velocity  vector.  Moreover,  as  the  com- 
ponents of  the  velocity  are  expanded  in  terms  of  series  of  polynomials  that  arise  as  the  solution  to 
a singular  Sturm  Liouville  problem,  whose  excellent  approximation  properties  are  well 
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documented  (e.g.  Gottleib  & Orszag  [1977],  Quarteroni  [1983])  convergence  of  our  approximation 
will  be  bound  only  by  the  smoothness  of  the  solution  and  by  the  number  of  terms  used  in  the 
component  expansions.  For  infinitely  differentiable  velocity  fields  we  should  expect  to  achieve 
"exponential  convergence"  (Canuto  et  al.) 

The  essence  of  the  method  (originally  due  to  Leonard  & Wray  [1982])  involves  expanding 
the  velocity  in  a series  of  divergence  free  vector  fields  each  of  which  satisfy  the  same  boundary 
conditions  as  the  velocity.  The  infinite  sums  are  truncated  and  substituted  into  the  governing 
equations,  which  are  the  Navier-Stokes  or  Euler  equations  linearized  about  the  appropriate  base 
flow.  Inner  products  are  taken  with  vectors  fields  which  satisfy  inviscid  boundary  conditions. 
This  eliminates  the  pressure  term  from  the  equations  and  reduces  the  differential  eigenvalue  prob- 
lem to  a matrix  eigenvalue  problem.  The  eigenvalues  determine  the  stability  of  the  base  flow  and 
the  eigenfunctions  are  the  set  of  coefficents  in  the  expansions  of  the  corresponding  perturbation 
velocity  fields. 

To  examine  how  this  method  works  we  recall  that  it  is  well  known  (Ladyshenskaya  [1966]) 
that  L2(D),  the  space  of  square  integrable  vector  functions  defined  on  a bounded  domain 
D (D  C Rn'  n = 2,3)  can  be  decomposed  into  those  that  are  divergence  free  and  whose  normal 
components  vanish  on  the  boundary  and  those  that  can  be  expressed  as  the  gradient  of  a 
differentiable  function  defined  on  D . For  this  paper  we  will  consider  vector  fields,  defined  on  the 
section  of  a cylinder  T,  which  are  periodic  in  both  the  axial  and  azimuthal  variables,  having  as 
their  axial  period  the  tube  length,  2H. 

We  will  decompose  L2(T)  as  follows. 

L\T ) = J{T)  + JL(T),  (2.1) 

where, 
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(a)  V'«  = 0 in  T, 

J(T)  = < «c!2(T)  (6)  u-n  = 0 on  dT, 

(<0  «|  sx  = «l  s,  • 


Si  and  S2  represent  the  ends  of  the  cylinder.  Given  in  this  form  J(T)  is  clearly  the  space  of 
(a)  incompressible,  (b)  inviscid,  (c)  periodic  velocity  fields. 

The  set  of  "viscous”  velocity  fields  on  T is  a subset  of  J(T)  denoted  J°(T). 


J°(T)  = \ ueJ(T)  | tt  = 0 on  8T 


An  alternative  representation  of  J(T)  (Richtmyer  [1978])  is  given  by, 


J(T)  = \ it  e L2(T) 


(а)  <u,  yp>  = 0 for  all  p e C°°(T) 

(б)  u\Si  = u\S2 


where  T is  the  closure  of  T and  <y>  represents  the  usual  inner  product  in  L2(T), 


<uyv>  = u-v  rdrdOdx  . (2*5) 

The  space  J(T)  endowed  with  this  inner  product  is  a Hilbert  space  and  a closed  subspace  of 
L2(T).  The  projection  of  L2(T)  onto  J(T)  will  be  denoted  by  n.  It  is  clear  that  vectors  of  the 
form  yp  are  perpendicular  to  all  u in  J(T)  and  in  fact  (Ladyshenskaya  [1966]), 


Jl(T)  = 4 u e L2(T)  | u = VP  for  some  p in  C^{T)  r . 


n then  has  the  following  properties, 


II  : L2{T)  J(T), 
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(2.8) 

(2.9) 


II  u = u for  all  u e J(T ), 

II  yp  = 0 for  all  p e Cl(T). 

To  determine  the  linear  stability  of  a flow  U to  say,  viscous  disturbances  which  are  periodic 
in  x and  9 we  consider  whether  infinitesimal  perturbations  to  U grow  in  time.  Therefore  we 
linearize  the  Navier-Stokes  equations  about  U and  seek  solutions  in  J°(T)  of  the  form, 

u(r,x,0)e-,v*.  (2.10) 


The  character  of  a then  determines  the  linear  temporal  stability  of  U. 

If  er  = a + ip  where  a,  /? 

are  real  then, 

£>0 

=>  U is  unstable, 

o 

II 

=>  U is  neutrally  stable,. 

(2.11) 

p<  0 

= > U is  stable 

The  equations  that  must  be  solved  have  the  form, 

t<ru  = Eu  + Re-15u.  (2.12) 

E and  S are  operators  defined  on  J(T)  as  follows, 

Su  = -n(yxw)  (2.13) 

and 

Eu  - II(a;xJ7  + flxu),  (2.14) 

where  H and  ware  respectively  the  base  and  perturbation  vorticities,  (fi  = yx  U,  lj  = yxu). 

Some  suitable  nondimensionalization  has  introduced  the  Reynolds  number, 

„ uoRo 

Re=-l-2.,  (2.15) 

Rq  and  C/q  being  characteristic  length  and  velocity  scales  associated  with  the  base  flow  and  v is 
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the  kinematic  viscosity  of  the  fluid.  We  can  take  R0  to  be  the  radius  of  the  tube  and  UQ  to  be  the 
maximum  value  of  the  columnar  base  flow  azimuthal  velocity,  V^r). 

The  VP  term  in  the  Navier-Stokes  equation  has  been  eliminated  by  projection  onto  J(T). 
Projection  of  the  stability  equation  onto  a finite  dimensional  subspace,  Jjy(T)  of  J(T)  is  achieved 
in  practice  by  taking  the  inner  product  of  the  equation  with  basis  vectors  for  JN(T).  This  pro- 
cess eliminates  the  operator  II  from  the  equation,  for  given  any  vector  / in  L2(T)  and  any  vector 
A in  J(T)  we  have  that, 


<11/,  A>  = </,  A>,  (2.16) 

as  projections  are  self  adjoint  and  as  the  projection  of  any  vector  in  J(T)  is  itself. 

It  is  worth  emphasising  that  even  when  we  are  solving  the  viscous  stability  equations,  we 
still  project  the  governing  equations  onto  the  space  of  inviscid  vector  fields.  The  reason  for  this  is 
that  having  found  a velocity  u such  that  the  vector  / defined  as, 

/ = icu  — wxf7  - fixu  + i/vx^  (2.17) 

is  orthogonal  to  all  A in  J(T)  then  / e J^(T)  and  so  there  exists  a scalar  function  p (a  pressure) 

with  / = VP*  If?  however,  / were  in  J°^(T),  which  contains  J^(T)  then  the  existence  of  a pres- 
sure is  not  guaranteed  and  consequently  u may  not  correspond  to  a physical  solution. 

Leonard  and  Wray  [1982]  demonstrated  a divergence  free  vector  function  expansion  for 
viscous  velocity  fields,  defined  on  a cylindrical  domain  that  are  Fourier  decomposable  in  both  the 
axial  and  azimuthal  variables.  We  will  construct  a somewhat  different  set  of  basis  vectors  here. 
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The  velocity  field,  u,  satisfies  the  continuity  equation  and  is  Fourier  decomposable  in  x and 
0,  which  means  in  effect  that  only  two  of  its  three  components,  u,  v,  w are  independent.  This 
motivates  the  introduction  of  two  vector  families,  in  an  expansion  of  the  form, 

* = Si  a2nJtmX^(r)  + °2n-lJtmX^(r)f  e^kz  + m0^  (2.18) 

nkm  \ J 

The  components  of  the  vectors  x * are  found  els  follows.  Expand  two  of  the  velocity  components, 
say  the  first  and  the  third,  independently  as, 


u 


nkm 


(2.19) 


nkm 


(2.20) 


where  /*(r)  are  complete  sets  of  polynomials  chosen  to  satisfy  the  boundary  conditions  that  are 
imposed  on  u,  tv.  The  r and  x components  of  have  now  been  picked  and  it  remains  for  us  to 
chose  the  0 components  in  a manner  that  ensures  the  vectors  e *(**  + are  divergence  free. 
Consider  for  example,  X^(r). 

S'(^T(r)  e‘(**  + mtf))  = °>  (2.21) 

= > 


(r/»(r))'  + '™Xn,e  = 0,  (2.22) 

where  the  prime  denotes  a derivative  with  respect  to  r.  This  equation  gives  us  the  0 component  of 
X^(r).  Rescaling,  it  is  found  that  an  expansion  of  the  form  (2.14)  is  possible  for  non-zero  azimu- 
thal wavenumbers  where, 

&»(r)  = ( *m/»(r)>  _ (r/n(r))/j  oj  , (2.23) 
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&t(r)  = ( 0,  - r*/„+(r),  rn/+(r)) 


and  such  an  expansion  will  guarantee  that  u is  divergence  free.  This  expansion  is  clearly  incom- 
plete for  azimuthal  wave  number  zero,  (m  = 0),  i.e.  for  axisymmetric  flows.  For  that  case  the 
following  expansion  vectors  can  be  used. 


Xn(r)  = ikfn(r)>  °>  “ — (r/„ (>•))'  , 


(2.25) 


2Gt(r)  = (o>  /»(»•)>  o) 


(2.26) 


The  polynomials  /*(r)  must  be  chosen  so  that  the  vector  u given  by  (2.14)  satisfies 
appropriate  (viscous  or  inviscid)  boundary  conditions  on  the  walls  of  the  domain,  T and  is  single 
valued  at  the  origin,  r = 0.  We  will  denote  the  polynomials  used  in  the  inviscid  case  by  a *(r) 
reserving  /±(r)  for  viscous  expansions.  We  have  then  upon  truncating  (2.14)  an  approximation 


to  u of  the  form, 


N K M 

UNKM  = E S E ankm  Pjikm{rixJ) 

n=l  k=—Km=—M 


(2.27) 


where, 


= ^(r»  m)  e ,(te  + m0) • (2.28) 

The  projection  vectors  will  have  the  same  form  as  the  expansion  vectors,  i.e.  we  will  project 
with  vectors,  A Jpf(r,x,0)  , where 


^IP1(r,x,e)  = ef(r;  k,  m)  e + «*> 


(2.29) 


l = l,...,jV;  p = q = — Af,...,AT 

with  the  vectors  (j  being  given  by  equations  (2.23  - 26)  using  the  inviscid  polynomials,  ( r ) for 
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the  components. 


It  is  possible  to  choose  the  polynomials  a/±(r)  and  /±(r)  in  many  different  ways.  Leonard  & 
Wray  [1982]  in  their  consideration  of  certain  turbulence  simulations  employed  an  unusual  set  of 
Jacobi  polynomials  to  reduce  the  bandwidth  of  the  final  matrix  system.  These  polynomials  were 
also  used  by  Spalart  [1983]  in  his  simulation  of  boundary-layer  transition.  Moser  & Moin  [1984] 
in  their  work  on  the  infinite  Taylor  Couette  system,  used  Tchebychev  polynomials  and  incor- 
porated the  weight  function,  against  which  these  polynomials  are  orthogonal,  into  the  projection 
vectors.  Here,  we  will  construct  the  basis  vectors  from  Legendre  polynomials.  All  of  the  above 
sets  are  solutions  to  singular  Sturm  Liouville  problems  and  consequently  we  can  expect  expan- 
sions in  terms  of  any  of  these  polynomials  to  exhibit  excellent  convergence  properties. 

The  single  valuedness  criterion,  which  must  be  applied  along  the  centre  line  of  the  tube  for 
the  vector  u,  causes  the  polynomials  /*(r)  and  a^(r)  to  depend  on  m,  the  azimuthal 
wavenumber  (Joseph  [1970]).  One  appropriate  choice  for  a±(r)  is, 

a/+(r)  = rP,(2r  - 1)  for  all  m, 

af(r)  = (1  - r)P,(2r  - 1)  if  | m|  =1,  (2.30) 

Oj“(r)  = r (1  - r)P,(2r  - 1)  if  I ml  * 

where  the  radial  variable  has  been  scaled  by  the  tube  radius  and  Pj(r)  is  the  Legendre  polynomial 
of  order  l (Abramowitz  & Stegun  [1970]).  The  corresponding  choice  for  /*(r)  is, 

fn(r)  = r(l  - r)P„(2r  - 1)  for  all 

/»(**)  = (1  - r)2Pn(2r  - 1)  if  | m|  = 1,  (2.31) 

/»(r)  = r (1  — r)2Pn(2r  — 1)  iflml^l- 
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As  all  of  our  stability  problems  separate  in  the  azimuthal  direction,  this  dependence  on  m 
presents  no  difficulty.  We  solve  separate  problems  for  each  azimuthal  wavenumber;  so  having 
chosen  an  m the  expansion  and  projection  sets  are  fixed  throughout  the  calculation  . Indeed,  in 
theory  there  is  no  difficulty  even  if  the  problem  at  hand  is  truly  three  dimensional;  however  some 
care  is  required  in  implementing  the  method  to  ensure  that  the  correct  radial  polynomial  set  is 
being  used  for  each  azimuthal  component  of  the  velocity. 


3.  Implementation  and  Verification  of  the  Method. 


Equation  (2.12)  is  solved  approximately  by  using  the  expansion  UjqxM  for  u and  taking  inner  pro- 
ducts of  the  equation  with  the  projection  vectors,  A ipq  to  get  a generalized  matrix  eigen  problem 
for  the  eigenvalues  a and  the  eigenvectors  a (the  coefficents  in  the  expansion  This  matrix 

problem  can  be  written  as, 


aAa 


a. 


(3.1) 


The  matrix  A is  purely  real  and  arises  from  the  fact  that  the  expansion  and  projection  vectors 
are  not  orthonormal. 


^ Inpkqm  <^Jpq>  Xji  — ^In^pk^qm^  (^*2) 

The  Kronecker  delta  symbol,  6|;*  arises  because  the  Fourier  bases  employed  in  the  axial  and 
azimuthal  directions  are  orthogonal.  The  matrix  B arising  from  the  convection  terms  is  also 
purely  real. 

& Inpkqm  “ ^^Jpqi  (Vx  ^jikm)  * “I"  0.*  (3*3) 

Finally,  the  matrix  C arising  from  the  viscous  terms  is  purely  imaginary. 
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Cl„k,m  = <^/„.VXVxCnim>. 


(3.4) 


Using  the  orthogonality  of  the  Fourier  bases  it  can  be  written  as, 


^inpkqm  Qn^pk^qm. 


(3.5) 


The  form  of  the  matrix  B depends  on  the  base  flow  U . For  columnar  flows  which  are  independent 
of  x and  0 it  is  possible  to  find  a matrix  B such  that, 


B Inpkqm  ^pk^qm. 


(3.6) 


The  stability  of  these  flows  can  be  determined  by  solving  the  O(N)  generalized  matrix  eigen 
problem, 


crAa  = 


b + -tc 


(3.7) 


On  the  other  hand,  for  the  axisymmetric  wavy  base  flow  (1.3,4)  we  have 


Inpkqm  ^Inpk^pk 


(3.8) 


and  the  stability  equation  is  the  (2 K + l)xivj  matrix  equation, 


crAa  = 


B + -tc 


(3.9) 


where  A and  C are  the  o|  (2 K + l)xivj  block  diagonal  matrices  and  clnspk  respectively. 

The  matrices  depend  parameterically  on  the  wavenumbers  of  the  projection  and  expansion  vec- 
tors so  we  separate  them  into  submatrices  that  can  be  evaluated  independently  of  these  and  the 
other  parameters  (in  particular  e)  occurring  in  the  base  flow.  The  submatrices  are  evaluated  once 
and  then  stored  in  the  computer.  The  required  integrations  can  be  done  at  very  little  cost  by  util- 
izing the  orthogonality  properties  of  the  expansion  and  projection  polynomials.  The  full  matrices 
are  then  be  reassembled  without  the  need  for  doing  any  further  integrations. 
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One  can  always  band  the  A and  C matrices  by  appropriate  choice  of  the  polynomials  /±(r) 
and  a*(r).  However  the  matrix  B will  generally  be  full,  though  for  certain  rather  simple  base 
flows  such  as  the  Hagen  Pouiseille  flow  considered  be  Leonard  & Wray  [1982]  it  is  also  possible  to 
band  B.  The  matrix  A was  inverted  to  produce  a regular  eigenvalue  problem  in  place  of  (3.1) 
and  the  QR  algorithm  was  used  to  extract  the  eigenvalues.  We  also  note  that  the  matrix  problem 
we  get  when  considering  the  inviscid  stability  of  base  flows  is  a purely  real  one  and  consequently 
the  eigenvalues  occur,  as  they  should  do,  in  conjugate  pairs. 

A computer  code  has  been  written  which  implements  the  method  we  have  been  describing  to 
solve  the  stability  problems,  both  viscous  and  inviscid,  for  all  columnar  flows  and  for  axisym- 
metric  flows  of  the  form  (1.3,4).  Both  the  direct  and  adjoint  versions  of  the  stability  problems 
can  be  solved.  The  adjoint  viscous  stability  problem  is  to  find  a u in  J°(T)  such  that 

»Au  = E*u  + (3.10) 

The  operator  E*  is  the  adjoint  operator  to  E and  is  given  by, 

E*u  = -n(  fixu  + vx(«x (3.11) 

The  direct  and  adjoint  spectra  obtained  by  solving  (2.12)  and  (3.10)  should,  of  course,  be  conju- 
gate to  each  other  and  how  well  a numerical  scheme  reproduces  this  theoretical  result  is  a test  of 
its  accuracy. 

We  verified  the  code  by  calculating  the  stability  of  rotating  Poiseuille  flow, 

U = ( 0,  Vxr,  1 - r2)  ) (3.12) 


Cotton  et  al.  [1980]  found  that  this  flow  with  Vj  = 0.2147  and  W\  = 1.0  was  neutrally  stable  to 
disturbances  having  azimuthal  wavenumber,  m = 1 and  axial  wavenumber,  k = — 1 for  a 
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Reynolds  number  of  156.  The  following  table  lists  the  most  unstable  eigenvalue  we  found  for  this 
flow  with  the  same  wavenumber  pair  for  the  disturbance.  The  first  column  of  the  table  gives  JV, 
the  number  of  radial  basis  vectors  that  were  used  to  obtain  the  eigenvalue  given  in  the  next  two 
columns,  N is  also  the  order  of  the  matrix  problem  that  needs  to  be  solved  at  each  step. 


Most  unstable  eigenvalue  found  for  the  rotating  Poiseuille  flow  (3.12) 
with  m = 1,  k = -1,  Vx  = 0.2147,  Re  = 156.0. 

N 

frequency 

growth  rate 

4 

-0.00029 

.00334 

6 

-0.00279 

.00101 

10 

-0.00284 

.000001 

14 

-0.002847 

.0000001 

18 

-0.002847898 

.0000001379 

22 

-0.002847898 

.0000001378 

The  convergence  is  exponential  in  N or  some  power  of  N and  there  is  no  evidence  of  significant 
roundoff  error.  The  following  table  lists  the  corresponding  eigenvalue  found  by  solving  the 
adjoint  viscous  stability  problem  for  the  same  wavenumber  pair  and  baseflow. 


Eigenvalue  found  by  doing  the  adjoint  viscous  stability  problem  for  the  flow  (3.12), 
with  m = 1,  k - - 1,  V1  = 0.2147,  Re  = 156.0. 

N 

frequency 

growth  rate 

4 

-0.00270 

-.000094 

6 

-0.00280 

-.000013 

10 

-0.00284 

-.000004 

14 

-0.002847 

-.0000002 

18 

-0.002847898 

-.0000001379 

22 

-0.002847898 

-.0000001379 

Clearly  the  agreement  between  the  adjoint  and  direct  results  is  excellent.  Inviscid  stability 
results  for  flows  of  the  form  (3.12),  obtained  using  our  code  also  compare  well  with  results  in  the 
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literature.  These  results  instill  confidence  in  the  accuracy  of  the  numerical  method  and  in  the 
code  that  implements  it,  at  least  for  the  case  of  columnar  flows. 

4.  Stability  Results  for  the  Vortex  Breakdown  Model  Flows. 

In  this  section  we  will  present  the  results  obtained  to  date  for  the  stability  of  the  midbubble 
columnar,  (1.9,10)  and  the  wavy  vortex  (1.3,4)  flows.  Although  these  flows  are  inviscid  we  will 
consider  their  stability  to  both  viscous  and  inviscid  disturbances  (i.e.  we  will  solve  the  linearized 
Euler  and  the  linearized  Navier-Stokes  equations  for  these  flows).  The  justification  for  doing  a 
viscous  analysis  is  that  the  "real11  flow  is  of  course,  viscous.  Moreover,  the  inclusion  of  the  higher 
order  dissipative  terms  eliminates  certain  technical  difficulties  that  arise  due  to  the  existence  of 
critical  layers  in  the  neutrally  stable  eigenfunctions  for  columnar  flows  (Drazin  & Reid  [1981]). 

We  will  begin  by  presenting  the  viscous  results  for  the  midbubble  columnar  flows.  We  found 
that  thirty  radial  vector  modes  ( N = 30)  were  adequate  to  resolve  the  most  unstable  eigenmode 
(i.e.  the  mode  whose  eigenvalue  had  the  largest  imaginary  part)  to  three  decimal  places  for  these 
flows  at  low  Reynolds  numbers  and  that  this  number  increased  as  the  Reynolds  number  grew. 
Frequent  checks  were  carried  out  on  the  accuracy  of  the  computed  eigenvalues  both  by  increasing 
the  order  of  the  expansion  and  also  by  computing  the  adjoint  spectrum  for  the  same  set  of  flow 
parameters.  The  difference  between  the  most  unstable  eigenvalue  as  computed  by  the  direct  and 
adjoint  versions  of  the  code  was  always  less  than  1%. 

Having  fixed  the  number  of  radial  expansion  vectors  in  our  system  the  viscous  eigenvalues 
for  the  midbubble  flows  depend  on  four  parameters, 

<x  = a(m,*,e,iZe).  (4.1) 
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The  base  columnar  flow  (e  = 0)  was  found  to  be  stable  to  all  disturbances.  It  seems  that  even  for 


very  small  values  of  e (values  for  which  there  is  no  recirculation  zone  in  the  full  two  dimensional 
flow)  the  midbubble  flows  are  unstable.  This  is  documented  in  the  following  table  which  gives 
bracketing  values  for  the  critical  Reynolds  number  for  various  values  of  e. 


Bracketing  values  for  the  critical  Reynolds  number. 
Various  values  of  e and  m = — 1. 

€ 

Stable  for  Re 

Unstable  for  Re 

Stable  for  all  Re 

550 

600 

180 

200 

0.015 

160 

180 

110 

120 

60 

80 

K51 

42 

45 

For  large  enough  values  of  e disturbances  having  both  positive  and  negative  azimuthal 
wavenumbers  can  destabilize  the  midbubble  flows  with  the  negative  azimuthal  modes  giving  rise 
in  general  to  the  largest  values  for  the  growth  rates.  In  particular  disturbances  with  azimuthal 
wavernumber,  m = — 1 were  found  to  be  the  most  dangerous.  This  is  shown  in  the  following 
table. 
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Bracketing  values  for  the  critical  Reynolds  number. 
Various  values  of  m with  e = .03. 

m 

Stable  for  Re 

Unstable  for  Re 

-i 

42 

45 

-2 

90 

100 

-3 

700 

800 

-4 

1200 

1400 

-5 

1400 

1600 

For  fixed  values  of  m and  e,  a two  parameter  (A:,  Re)  study  was  carried  out.  With  e = .03 
and  m = -1  we  obtain  the  stability  diagram  shown  in  Figure  5.  The  stability  boundary  appears 
to  be  a parabolic  curve  which  is  markedly  asymmetric  with  respect  to  the  k = 0 line.  Within  this 
curve  the  base  flow  is  unstable  to  a range  of  axial  wavenumbers;  however,  there  is  a "tongue"  of 
stable  wavenumbers  that  gradually  thins  out  as  the  Reynolds  number  is  increased.  The  k = 1 
mode  is  the  final  one  to  be  excited,  this  does  not  happen  until  Re  = 4500  (approximately). 

We  now  consider  the  stability  of  the  midbubble  flows  to  inviscid  disturbances.  The  invis- 
cisid  stability  of  columnar  flows  is  governed  by  an  ordinary  differential  equation,  the  Howard- 
Gupta  [1962]  equation.  A number  of  analytic  results  obtained  from  this  equation  exist  in  the 
literature.  Leibovich  & Stewartson  [1982]  showed  that  a sufficent  condition  for  the  instability  of  a 
columnar  flow  is  that  the  function, 

Hr)  = F(r)A'(r)(  A'(r)r(r)  + (4.2) 

/ 

be  negative  somewhere  in  the  domain  of  interest.  (A  is  the  angular  velocity,  — V and  T is  the  cir- 

r 

culation  rV .) 
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Th  function  i^(r)  is  easily  evaluated  for  the  midbubble  profiles  and  this  has  been  done  for  a 
range  of  values  of  e.  It  was  found  that  F first  becomes  negative  only  when  a critical  value  of  e is 
reached,  e = 0.132.  Consequently  the  midbubble  flows  are  guaranteed  to  be  unstable  by  the 
Leibovich-Stewartson  criterion  for  values  of  the  parameter  e slightly  below  those  needed  to  pro- 
duce a recirculation  region  in  the  full  two  dimensional  flow  (recall  this  happens  for  e = 0.155). 

A normal  mode  stability  analysis  was  carried  using  the  divergence  free  expansion  method 
and  the  numerically  obtained  results  confirm  and  somewhat  extend  the  predictions  of  the 
Leibovich-Stewartson  criterion.  Once  again  we  found  that  the  disturbances  giving  rise  to  the 
largest  growth  rates  had  azimuthal  wavenumbers,  | m|  =1.  Figure  6 shows  how  the  maximum 
growth  rate  varies  (almost  linearly)  with  the  amplitude  parameter,  e.  In  the  figure  we  can  see 
that  the  normal  mode  analysis  extends  the  previously  obtained  results  in  that  midbubble  flows 
with  e < 0.0132  are  also  found  to  be  unstable,  even  though  F(r)  > 0 for  these  flows.  The  max- 
imum growth  rates  also  increase  with  the  size  of  the  axial  wavenumber,  | A:|  , as  shown  in  Figure 
7. 


We  conclude  that  the  midbubble  flows  are  definitely  unstable  on  both  viscous  and  inviscid 
grounds  for  values  of  the  parameter,  e below  those  needed  to  produce  a reversed  flow  region  in  the 
full  two  dimensional  wavy  flow.  Moreover  the  most  destabilizing  disturbances  have  azimuthal 
wavenumbers,  | m\  = 1 and  short  axial  period,  (|  £|  >1).  The  unstable  inviscid  eigenfunctions 
tended  to  have  regions  of  steep  gradient  near  the  origin  and  this  made  their  resolution  difficult. 

While  these  midbubble  stability  results  we  have  just  reported  tend  to  support  the  conjec- 
tures made  about  vortex  breakdown,  they  are  not  encouraging  for  the  numericist  seeking  to  inves- 
tigate the  stability  of  the  cnoidal  wave  flows  (1.3,4).  As  we  noted  earlier,  the  stability  equation 
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for  these  flows  do  not  separate  in  x and  consequently  we  have  to  solve  0(iV(2/if+l))  matrix 
eigenvalue  problems  for  each  flow.  It  would  seem  to  be  necessary  to  include  modes  having  a short 
axial  period  in  our  expansion,  u NKM  which  means  that  K will  be  large.  The  inclusion  of  these 
modes  is  also  dictated  by  physical  considerations.  We  should  like  the  disturbance  to  include 
modes  that  scale  with  the  dimensions  of  the  bubble  and  in  fact  visualization  studies  indicate  that 
the  asymmetric  unstable  modes  do  have  short  axial  periods.  However,  our  experience  with  the 
midbubble  flows  show  that  these  unstable  modes  are  difficult  to  resolve  radially,  consequently  the 
number  of  radial  vector  functions,  N in  our  expansion  must  also  be  large. 

With  these  constraints  the  normal  mode  analysis  of  the  cnoidal  wave  flows  becomes  prohibi- 
tively expensive,  (reacall  that  the  number  of  operations  needed  to  extract  all  the  eigenvalues  of  a 
matrix  is  proportional  to  the  cube  of  its  order).  To  alleviate  the  cost  problems  most  of  the  runs 
for  these  flows  were  done  with  the  axial  period  of  the  cnoidal  wave  fixed  at  1 [H  = 0.5,  units  are 
tube  radii).  While  such  short  period  flows  violate  the  assumptions  needed  to  produce  the  solu- 
tions (1.3,4)  it  was  hoped  that  these  flows  would  be  unstable  to  disturbances  with  smaller  values 
for  K.  Indeed  convergence  studies  indicate  that  adequate  resolution  in  the  axial  direction  was 
obtained  with  K « 10.  Up  to  40  radial  modes  were  used  in  the  study,  leading  to  an  0(840) 
matrix  eigenvalue  problem  when  K = 10.  (The  calculations  were  performed  on  an  Floating  Point 
Systems  164  series  vector  processor  at  Cornell  University,  using  a vectorized  version  of  the  QR 
algorithm,  optimized  for  this  machine  and  using  some  16  megabytes  of  memory.)  The  adjoint 
problem  was  also  solved  on  each  run  and  only  those  eigenvalues  which  agreed  well  between  the 
adjoint  and  direct  calculations  were  considered. 


Unfortunately  the  short  period  flows  behave  less  like  the  midbubble  flows  and  more  like  the 
underlying,  stable  columnar  flows.  This  is  indicated  in  Figure  8 which  plots  the  least  stable 
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growth  rates  found  for  the  various  values  of  e as  the  Reynolds  number  is  increased  (m  = —1  in 
this  plot).  All  these  short  period  cnoidal  wave  flows  are  stable,  though  marginally  so.  The  dis- 
turbances with  | m|  = 1 are  again  the  most  unstable.  Figure  9 shows  the  least  stable  growth  rates 
found  for  some  other  azimuthal  wavenumbers.  The  inviscid  stability  runs  that  were  performed 
also  failed  to  turn  up  any  definite  evidence  of  instability  for  these  flows. 

We  have  considered  the  viscous  and  inviscid  stability  of  some  axisymmetric  flows  which  are 
said  to  model  the  bubble  type  of  vortex  breakdown.  The  investigation  was  carried  out  by 
expanding  the  perturbation  velocity  in  terms  of  the  new  set  of  divergence  free  vectors  presented 
in  section  2.  The  stability  results  for  the  midbubble  flows  support  the  conjectured  mechanism  for 
breakdown  and  it  was  found  that  the  most  dangerous  disturbances  have  a short  axial  period  and 
azimuthal  wavenumbers,  | m|  =1.  The  two  dimensional  flows  we  considered  all  had  rather  short 
axial  periods  and  these  do  not  model  the  physical  phenomena  well.  No  conclusions  as  to  the  sta- 
bility of  these  flows  can  be  drawn  because  the  normal  mode  analysis  used  here  can  only  prove  ins- 
tability (by  finding  a growing  disturbance).  No  evidence  of  instability  was  found  but  this  may 
well  be  because  we  failed  to  include  enough  axial  modes  in  our  expansion  for  the  disturbance. 
The  study  clearly  points  out  that  linear  stability  investigations  for  complex  base  flows  are  far 
from  trivial  from  a computational  point  of  view. 
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Figure  1.  Axisymmetric  bubble  type  vortex  breakdown  (after  Leibovich  [1978]). 
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Figure  3 


Axial  velocity  profiles  for  the  midbubble  flow, 
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Figure  4.  Azimuthal  velocities  for  the  midbuble  flows. 
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Figure  6.  Maximum  growth  rates  plotted  against  e,  m = — 1 and  k = 6. 
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Figure  7. 


Maximum  growth  rates  plotted  against  axial  wavenumber,  (inviscid  analysis  for 
the  midbubble  flow),  m = — 1. 
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Figure  8.  Least  stable  growth  rates  found  for  the 
H = .5,  m = —1. 
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Abstract 

The  incompressible  axisymmetric  steady  Navier-Stokes  equations  and  the 
Euler  equations  are  solved  numerically  to  model  the  breakdown  of  a vortex. 
The  solutions  obtained  for  the  Euler  equations  show  a "vortex  breakdown-like" 
structure,  their  behavior  is  very  different  from  that  of  the  Navier-Stokes 
solution  which  are  obtained  at  low  Reynolds  number.  The  details  of  the 
numerical  algorithms  used  are  presented,  and  the  results  obtained  are  compared 
to  those  in  the  literature  at  the  same  Reynolds  number. 
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1.  INTRODUCTION 


Under  certain  conditions,  it  has  been  observed  that  the  vortex  shed  from 
the  highly  swept  leading  edges  of  a delta  wing  can  change  its  structure 
abruptly.  The  change  is  characterized  by  either  a spiral  deformation  of  the 
vortex  axis  or  the  formation  of  a stagnation  point  along  the  vortex  axis 
followed  by  a bubble  of  recirculating  flow.  Downstream  of  this  structural 
change,  the  flow  appears  to  be  highly  sensitive  to  perturbations  and  is 
usually  turbulent.  This  sudden  change  in  structure  is  known  as  vortex  break- 
down. The  effect  of  vortex  breakdown  on  the  aerodynamics  of  a wing  is  very 
important,  since  it  degrades  the  performance  of  the  wing  and  can  set  a limit 
on  the  maximum  attitude  achievable  by  the  wing.  The  phenomenon  has  been 
studied,  both  experimentally  and  theoretically,  for  the  last  30  years,  but  no 
really  satisfactory  theory  exists  to  explain  it.  The  reader  is  referred  to 
the  two  reviews  of  the  subject  given  by  Leibovich  [1,2]. 

Our  interest  in  vortex  breakdown  was  aroused  by  the  claim  of  Hitzel  and 
Schmidt  [3]  that  vortex  breakdown  could  be  predicted  on  the  basis  of  the  Euler 
equations.  We  consider  that  the  numerical  studies  of  flow  over  a delta  wing 
by  Hitzel  and  Schmidt  are  too  superficial  to  warrant  such  a conclusion.  The 
flow  over  a delta  wing  at  high  angles  of  attack  is  too  complex  and  requires 
too  many  computational  resources  to  allow  an  indepth  study.  How  could  the 
problem  be  formulated  such  that  it  would  lend  itself  to  an  investigation  of 
the  relevance  of  the  Euler  equations  vis-a-vis  the  Navier-Stokes  equations?  A 
drastic  simplification  of  the  problem  is.  required.  Fortunately,  experi- 
mentalists have  already  achieved  this  by  studying  vortex  breakdown  within  the 
confines  of  cylindrical  tubes.  In  addition,  two  numerical  investigations  of 
the  Navier-Stokes  equations  have  been  presented  [4,5]  for  this  problem;  in  one 
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case  [4]  steady  solutions  were  obtained,  while  in  the  other  [5]  the  solutions 
appeared  unsteady.  Perhaps  an  additional  investigation  could  shed  some  light 
onto  this  problem.  The  purpose  of  this  work  is,  therefore,  to  investigate  the 
possibility  of  simulating  vortex  breakdown  with  the  Euler  equations,  studying 
the  relation  of  these  solutions  to  those  of  the  Navier-Stokes  equations  and 
comparing  the  latter  to  those  in  Refs.  4 and  5. 


2.  MATHEMATICAL  FLOW  MODELS 

An  incompressible  steady  axisymmetric  flow  with  swirl  can  be  described  in 
terms  of  a streamfunction,  i|);  azimuthal  vorticity  component,  w;  and  a 
circulation,  k.  In  cylindrical  coordinates  (x,r,0)  the  Navier-Stokes 
equations  are : 


r ^)r  + *xx  = r“ 

(U“)r  + <™>x  + (4)x  - ir  («rr  + (£)  + “xx) 

r c r 

UKr  + ^x  = (Krr  " 7 Kr  + Kxx)  (2.1) 

where  k = rv,  u>  = wr  - ux,  and  Rg  is  the  Reynolds  number  defined  in  terms 
of  the  free-stream  axial  velocity,  the  vortex  core  radius,  and  the  kinematic 
viscosity  of  the  flow.  The  velocity  components  in  the  x,r,0  directions  are 
denoted  by  w,  u,  and  v,  respectively.  In  terms  of  the  streamfunction,  w 
and  u are  given  by: 
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u = 


(2.2) 


The  inviscid  equations  are  obtained  from  Eqs.  (2.1)  by  letting  Rg  -»■  «. 
In  the  inviscid  limit,  it  is  clear  that  the  circulation  becomes  constant  along 
a streamline.  It  can  also  be  shown  that  the  total  enthalpy,  h,  becomes 

constant  along  streamlines.  Moreover,  the  vorticity  component  oj  can  be 

related  to  the  gradients  of  the  circulation  and  the  total  enthalpy  by: 


dh  _ 1 d(ic2) 
d\{j  2 


(2.3) 


where  the  total  enthalpy  is  given  by: 

h = p + -j  (u^  + v^  + w^) 


(2.4) 


and  p is  the  pressure.  Notice  that  in  the  absence  of  swirl  (v  = 0),  rn/r 
becomes  constant  along  a streamline.  We  also  notice  that  the  contribution  to 
the  vorticity  (given  by  eqn.  (2.3))  due  to  circulation  term  does  not  depend 
on  the  sign  of  < but  only  on  its  magnitude.  The  functions  <(\J)(x,r))  and 
h(t(x,r))  are  determined  in  terms  of  the  specified  inflow  profile  <(^(o,r)) 
and  h(iJ>(o,r))  at  the  upstream  boundary,  provided  that  i|>(x,r)  is  positive 
(i.e.,  outside  a recirculation  bubble).  Inside  the  bubble,  the  k and  h 
distributions  are  not  known.  In  fact,  within  the  inviscid  model  a 
discontinuity  is  admissible  across  the  streamline  forming  the  bubble  (the 
separating  streamline).  One  way  to  avoid  this  problem  is  to  invoke  analytic 
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continuation  of  the  functions  k(<J>)  and  h(ij/)  for  negative  ip.  In  the 

present  work,  the  dependence  of  k and  h is  known  analytically  for  positive 
4*  from  the  assumed  initial  profiles.  The  same  functional  dependence  is 
assumed  for  negative  \ p.  As  a side  point,  it  should  be  mentioned  that  since 
k vanishes  along  a separating  streamline  (k  = 0 on  the  axis)  and  k is 
analytically  continued  inside  the  bubble,  it  is  reasonable  to  assume  that  k 
changes  sign  inside  the  bubble;  and,  as  a consequence,  the  swirl  velocity  in 
the  bubble  has  the  opposite  sense  to  the  swirl  in  the  main  flow.  This  is  not 
the  behavior  observed  with  the  viscous  problem  at  Re  = 100  and  200. 

In  solving  the  Navier-Stokes  equations,  the  viscous  terms  play  an 
important  role  in  the  neighborhood  of  the  separating  streamline  by  preventing 
the  formation  of  discontinuous  solutions.  A similar  role  is  played  by  the 
artificial  viscosity  terms  in  Euler  calculations  based  on  primitive  variables 
(i.e.,  velocity  components).  However,  it  may  be  argued  that  although  the 

artificial  dissipation  is  critical  in  singling  out  a solution,  the  solution 
may  be  independent  of  Its  form  and  magnitude.  In  the  least  square  formulation 
used  in  this  work,  there  is  no  explicit  or  implicit  artificial  dissipation. 
In  fact,  the  truncation  errors  for  the  central  differences  to  be  used  are  of  a 
dispersive  nature.  It  is,  however,  the  assumption  of  analytic  continuation 
of  k and  h,  for  the  inviscid  problem,  that  rules  out  any  discontinuities. 

If  we  let  the  vortex  core  radius  at  the  upstream  boundary  be  r = 1 and 
the  radius  at  the  farfield  boundary  be  r = R,  the  inflow  profiles  at  x = 0 
are  given  by: 
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u(r)  = 0 

v(r)  = Vr(2  - r2) 
v(r)  = V/r 
w(r)  = 1 


0 < r < R 
r < 1 


r > 1 
0 < r < R 


(2.5) 


where  V is  the  maximum  circumferential  swirl  velocity  at  the  edge  of  the 
vortex  core.  These  profiles  are  the  same  as  those  used  in  Ref.  4.  From  these 
profiles,  it  follows  that  the  circulation  at  the  upstream  boundary  is  given 
by: 

k2  = 16V2t|)2  (1  - i|02  < 1/2 


k2  = V2 


i>  > 1/2 


(2.6) 


and  that  the  vorticity  component  is  given  by: 


ra) 


= 16V2  (1  + 2i|>2  - 3<|0(t-  - <|>) 


< 1/2 


(2.7) 


ra  = 0 


* > 1/2 


In  terms  of  a perturbation  streamfunction  ¥ = - \{j,  the  equation  governing 

the  inviscid  flow  is: 

'i'xx  + r <Vr)r  = -^V2ct2'P  (2.8) 

where 
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a2  = 4 (1  + 2<J i2  - 3<|>) 


i>  < 1/2 


2 


a 


- 0 


i>  > 1/2 


(2.9) 


We  notice  that  since  ¥ vanishes  at  the  axis  and  if  we  require  ¥ to 
vanish  in  the  farfield,  then  the  trivial  solution  H1  = 0,  corresponding  to 
cylindrical  stream  surfaces,  is  a solution  of  the  above  equation.  We  are, 
however,  interested  in  nontrivial  solutions. 

Using  standard  central  difference  approximation,  equation  (2.8)  leads  to 
a nonpositive  definite  matrix  which  is  difficult  to  solve  by  standard 
relaxation  schemes.  To  avoid  this  problem,  a least-square  variational 
formulation  is  obtained  for  the  function 

F(Y,u',w')  = /Jn^£  - w')2  r + + u')2  r + (w'  - u'  + 4V2  f)2W 

(2.10) 


where  0 extends  over  the  domain  of  interest  and  u and  w are  the 
perturbation  velocities  in  terms  of  ¥.  The  first  term  in  parenthesis  in  the 
kernel  of  Eq.  (2.10)  corresponds  to  the  definition  of  w' , the  second  term 
corresponds  to  the  definition  of  u' , and  the  last  term  corresponds  to  Eq. 
(2.8).  Each  of  these  terms  should  vanish  in  the  steady  state.  To  form  the 
kernel  of  Eq.  (2.8),  each  of  the  above  terms  is  multiplied  by  an  arbitrary, 
but  positive,  weight  function.  The  choice  of  r as  the  weight  function  for 
the  first  two  terms  and  the  cube  of  unit  length  for  the  last  term  was  made  to 
simplify  the  form  of  the  resulting  equations.  From  the  function  (2.10),  the 
following  Euler  equations  are  easily  obtained  [6]: 
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u — ru  = ¥ + w +2 

XX  iU  Tx  rx 


(2 


w - rw  = - Y + u - g 
rr  r xr  6r 


where 


,„2  2 

4V  a 

g = — - — V 


3.  NUMERICAL  FORMULATION 

3.1  Invlscid  Problem 

Equations  (2.11)  are  solved  using  a staggered  grid  for  V,  u , and 
With  Y defined  at  i,j  nodes,  u'  between  nodes  of  horizontal  lines, 
w'  between  nodes  of  vertical  lines,  the  resulting  discrete  equations  are: 
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tWi,j+3/2  2wi , j+1/2  + wi,J-l/2) 
*_2 


ri,j+l/2Wi, j+1/2 


w - 

i,j+l 

Ar 


+ (ui+l/2 . j+1  ui-l/2.i+l  ui+l/2.i  + ui-l/2.j) 

AxSr 


(si,j+i  gi,j) 
Ar 


where 


(i-1)  Ax  i = 1,2. 

(j-l)  Ar  j = 1,2. 


.1 

max 

(3.2) 

.J 

max 


The  first  equation  of  (3.1)  is  solved  for  'F  by  the  Zebra  vertical  line  over- 
relaxation algorithm;  the  second  equation  is  solved  for  u by  direct 

inversion  of  horizontal  lines;  the  last  equation  is  solved  for  w by  direct 
inversion  of  vertical  lines.  For  the  first  equation  of  (3.1),  the  boundary 

condition  consists  of  'F  = 0 all  around  the  domain.  For  the  second  equation, 

boundary  conditions  are  required  at  i = 1+1/2  and  i = I - (1+1/2). 

max 

* 

Boundary  conditions  for  u at  i = 1+1/2  are  obtained  by  solving 


[ux  - (wr  + g)]  - [*x  + ru  ] = 0 


(3.3) 


at  i = 2;  while  at  i = Imax  - (1+1/2),  boundary  conditions  are  obtained  by 
solving 


[ux  - (wr  + g)]  + [^x  + ru  ] = 0 


(3.4) 
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at  1 = Xmax  “ 1*  Each  term  in  square  brackets  appearing  in  Eqs.  (3.3)  and 
(3.4)  vanishes  in  the  steady  state.  The  change  in  sign  between  Eqs.  (3.3)  and 
(3.4)  is  introduced  to  add  to  the  diagonal  dominance  of  the  discrete 
equations.  Similarly,  for  w the  equation 


1 

w - (u  - 
- r x 

g)]  + [Yr  - rw']  = 0 

(3.5) 

is  solved  at 

i - 2 to 

obtain  w 

at  j = 1+1/2;  while  at 

j ~ ^max  ~ 1 > the 

equation 

i 

i 

\ u 

Cq 

1 

ME- 

1 

£ 

II 

O 

(3.6) 

is  solved  to 

obtain  w 

at  j = Jmax  - 0+1/2). 

3.2  Viscous 

Problem 

^he  first  equation  of  (2.1)  is  discretized  using  second-order- accurate 
central— difference  formulas.  To  the  second  and  third  equations  of  (2.1)  the 
time  terms  wt  and  are  added  to  the  left-hand  side  of  each  equation, 

respectively.  The  convection  terms  of  these  two  equations  are  discretized 
using  upwind  first-order  accurate  formulas,  while  the  diffusion  terms  are 
discretized  using  second-order-accurate  formulas.  The  time  terras  are 
discretized  using  first-order  backward  derivatives.  Unlike  the  inviscid 
problem,  a nonstaggered  grid  is  used  for  the  viscous  problem.  The  discrete 
equations  are 
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Ar2  2Ar  ri,j  Ax2 


(3.7) 


Quantities  with  a bar  are  taken  at  the  new  time  or  iteration  level.  The  time 

step  is  chosen  equal  to  Ax.  (Note  that  the  identity  u + w = — was  used 

r x r 

to  simplify  the  convective  terms  of  the  second  equation  above.)  The  Eqs. 
(3.7)  are  solved  by  vertical  line  over-relaxation  with  the  following  boundary 
conditions: 
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At  x = 0, 


k = 4Vip(l  - i|>)  0 < r < 1 

k = V 1 < r < R 


03  = ip  /r  ; 
xx  ’ 


at  r = 0, 

il>  = 0 

K = 0 
(d  = 0 ; 


k = V 
a)  = 0 ; 

at  the  outflow,  x = L, 

ip  = 0 
rx 

Kx  = 0 

03  =0 

X 

To  improve  the  convergence  rate  of  the  viscous  problem,  the  acceleration 
method  described  in  Ref.  7 was  used. 
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4.  DISCUSSION  OF  RESULTS 


In  order  to  measure  the  deviation  of  a solution  from  the  trivial 
2 

£ 

solution  tj;  = , we  define  the  norm  of  the  perturbation  strearafunction  as 


I J 
max  max 


= LRl  I l ^ 

\i=i  j=i  1J 


(i  j > 

max  max 


(4.1) 


Tests  were  performed  to  determine  the  required  number  of  mesh  points  and  the 
required  locations  of  the  farfield  boundaries  to  achieve  a certain  level  of 
accuracy.  Two  of  these  tests  are  illustrated  in  Figures  1 and  2 for  the 
inviscid  problem.  Figure  1 shows  the  asymptotic  behavior  of  the 

streamfunction  norm  as  the  number  of  mesh  points  in  the  axial  direction  is 
increased,  holding  all  other  parameters  fixed.  Figure  2 shows  the  effect  of 
the  location  of  the  outflow  boundary  on  the  norm.  From  this  study,  it  was 
concluded  that  for  the  inviscid  problem  a minimum  spacing  Ar  = Ax  = 1/16 

was  required  and  that  R > 2,  L > 4 was  also  required.  The  same 

requirements  were  found  for  the  viscous  problem  for  100  < Re  < 200,  except 
that  the  location  of  the  outflow  boundary  had  to  be  increased  to  L > 10. 
Figures  3 and  4 show  that  the  same  solution  is  obtained  for  the  inviscid 
problem  with  L = 5 and  L = 10.  For  all  cases  presented,  residuals  were 
driven  to  machine  zero,  0(10”^). 

A summary  of  the  results  is  given  in  Figure  5,  This  figure  shows  the 
norm  defined  by  Eq.  (4.1)  as  a function  of  the  square  of  the  swirl  parameter 
V.  Two  nontrivial  branches  were  found  for  the  inviscid  problem.  The  first 

branch,  indicated  in  the  figure  by  the  closed  circles,  corresponds  to 

axisymmetric  vortex  breakdown-like  solutions.  Figs.  (3),  (4),  and  (6) 

illustrate  the  streamline  topologies  found  in  this  branch.  The  same  branch 
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was  found  by  Ta'asan  [8]  using  a multigrid  algorithm  to  solve  Eqs.  (2.8).  As 

shown  in  Figure  5,  our  results  and  those  of  Ta'asan  are  in  good  agreement.  The 

problem  with  this  branch  is  that  as  the  swirl  parameter  is  increased,  the  size 

of  the  bubble  decreases.  This  behavior  contradicts  the  experimental 

observations.  The  second  inviscid  branch,  indicated  by  the  open  circles, 

o 

intersects  the  first  at  approximately  V = 0.575.  For  values  of  V near  the 

intersection  of  the  two  branches,  the  numerical  algorithm  developed  a limit 

cycle  where  a single  bubble  splits  in  two.  The  two  bubbles  later  coalesce  and 

the  cycle  is  repeated.  The  limit  cycle  prevented  convergence  to  a steady 

state.  Ta'asan  only  encountered  the  first  branch  and  was  able  to  continue 

this  branch  down  to  the  axis,  as  shown  in  Figure  5.  The  streamlines 

corresponding  to  the  second  branch  are  illustrated  in  Figure  7.  Obviously, 

this  -branch  is  not  of  the  vortex  breakdown  type.  It  is  believed  that  the 

second  branch,  although  a solution  to  the  least  square  problem,  is  not  a 

solution  of  the  original  inviscid  problem  (Eqs.  2.1  with  Re  «).  The 

evidence  for  this  comes  from  inserting  the  least-square  solutions  into  the 

original  equations  and  evaluating  the  residuals.  When  this  is  done  for  the 

2 

first  branch,  residuals  of  the  order  of  Ax  are  found.  For  the  second 

branch,  the  residuals  are  of  order  Ax,  and  remain  at  the  same  level  when  the 
mesh  is  refined. 

For  the  viscous  problem,  results  are  presented  in  Figure  5 for  Reynolds 
numbers  100  and  200.  The  axial  velocities  obtained  here  and  those  obtained  by 
Grabowski  and  Berger  [4]  are  compared  in  Figure  8.  The  agreement  is  good  when 
we  consider  that  Grabowski  and  Berger  used  a much  coarser  but  highly  stretched 
mesh,  slightly  different  outflow  boundary  conditions,  and  did  not  converge 
their  solutions  to  the  same  level  as  in  this  work.  It  also  appears  that  the 
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results  obtained  by  Krause  et  al.  [5]  are  anomalous,  since  they  were  unable  to 
obtain  steady-state  solutions  for  the  same  cases  studied  here.  Their  failure 
to  reach  a steady  state  could  be  a result  of  the  outflow  boundary  being  at 
L = 5,  too  close  to  the  inflow  boundary*  In  our  work,  we  found  this  location 
for  the  outflow  boundary  to  lead  to  a large  open  bubble  (see  Figure  9),  but 
the  solution  was  steady  nonetheless. 

Figure  10  illustrates  the  changes  in  the  bubble  structure  as  the  swirl 

parameter  is  increased  with  Re  - 100  held  fixed.  The  same  is  illustrated  in 

Figure  11  for  Re  = 200.  It  is  interesting  to  see  the  very  rapid  change  in 

2 

the  norm  that  occurs  at  Re  = 200  and  V * 1.27.  (See  Figure  5.)  This 
behavior  opens  some  questions  about  possible  hysteresis  and  bifurcation  at 
higher  Reynolds  number.  However,  our  present  approach  is  not  capable  of 
handling  much  higher  Reynolds  numbers  well;  and,  therefore,  these  questions 
will  be  considered  at  a later  time.  Figure  12  shows  the  minimum  value  of  the 
axial  velocity  component  on  the  axis  as  a function  of  V for  Reynolds  numbers 
of  100  and  200.  The  point  at  which  the  recirculation  bubble  first  appears 
corresponds  to  the  first  intersection  of  these  curves  with  w = 0.  The  second 
intersection  corresponds  to  the  point  at  which  the  recirculation  bubble  lifts 
off  the  axis. 


5.  CONCLUDING  REMARKS 

Numerical  solutions  of  the  Euler  equations  were  obtained  and  a vortex 
breakdown-like  topology  was  observed.  Those  solutions  were  in  good  agreement 
with  those  obtained  by  Ta^asan  [8].  For  the  Navier-Stokes  equations, 
solutions  were  also  obtained  with  vortex  breakdown-like  topology.  These 
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latter  solutions  were  in  good  agreement  with  the  results  reported  in  Ref.  4. 
The  behavior  of  the  inviscid  solutions  with  increasing  swirl  was  not 
consistent  with  the  behavior  of  the  Navier-Stokes  solutions  at  low  Reynolds 
number  nor  with  experimental  observations.  (Experimental  results  showing 
bubble-type  vortex  breakdown  are  usually  obtained  at  higher  Reynolds  numbers.) 
A future  study  will  investigate  the  high  Reynolds  number  limit  of  the  Navier- 
Stokes  equations  and  compare  it  to  the  Euler  solutions  obtained  here. 
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Figure  1.  Convergence  of  the  norm  of  the  inviscid  streamf unction  4'  with 
increasing  resolution  in  the  axial  direction,  holding  L = 5, 
R = 2,  V2  - 0.4,  and  4r  - 1/16  fixed. 
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Figure  2.  Effect  of  increasing  the  length  of  the  domain  on  the  norm  of  the 
inviscid  streamfunction  V , holding  R = 2,  Ar  = Ax  = 1/16. 
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Figure  3.  Computed  streamline  pattern  for  = 0.2,  L = 5,  R = 2,  and  Ax  = Ar  = 1/16. 


Figure  4.  Computed  streamline  pattern  for  - 0.2,  L - 10,  R - 2,  and  ax  ~ Ar  ~ 1/16. 
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Figure  5.  Norm  of  the  streamf unction  as  a function  of  V^. 


284 


X 


Figure  8,  Comparison  f velocity  on  vortex  axis  between  present  results 
(solid  line)  and  those  of  Ref.  4 (dashed  line). 


L 


Figure  9.  Computed  streamline  pattern  for  Re  = 200,  V = 0*8944,  L = 5, 

R » 2,  and  Ax  « Ar  = 1/16.  The  shortness  of  the  domain  results 
in  a large  open  bubble.  This  case  corresponds  to  the  same 
conditions  of  Figure  4.14,  Ref.  9. 
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(c)  V = 1.2 


(e)  V = 1.5 


Figure  10.  Computed  streamline  patterns  for  Re  = 100,  L = 10,  R = 2, 

Ax  = Ar  = 1/16,  and  increasing  values  of  V.  Details  of  the 

bubble  structure  are  shown  on  the  insets. 
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(d)  V = 1.15 


Figure  11.  Computed  streamline  patterns  for  Re  = 200,  L = 10,  R = 2, 

Ax  = Ar  = 1/16,  and  increasing  values  of  V.  Details  of  the 
bubble  structure  are  shown  on  the  insets. 
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MULTIGRID  METHOD  FOR  A VORTEX  BREAKDOWN  SIMULATION 


Shlomo  Ta'asan 

Institute  for  Computer  Applications  in  Science  and  Engineering 


ABSTRACT 

In  this  paper  we  study  an  inviscid  model  for  a steady  axisymmetric  flow 
with  swirl.  The  governing  equation  is  a nonlinear  elliptic  equation  which  has 
more  than  one  solution  for  a certain  range  of  the  swirl  parameter.  The 
physically  interesting  solutions  have  closed  streamlines  that  look  like  vortex 
breakdown  ("bubble"-like  solutions).  A multigrid  method  is  used  to  find  these 
solutions.  Using  an  FMG  algorithm  (nested  iteration),  the  problem  Is  solved 
in  just  a few  multigrid  cycles. 
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1 . INTRODUCTION 


In  this  paper  we  study  an  inviscid  model  for  steady  axisymmetric  flow  with 
swirl,  which  has  solutions  with  closed  streamlines.  These  solutions  have  a 
structure  similar  to  that  observed  experimentally  as  "bubble"-like  solutions 
when  vortex  breakdown  occurs  [4], 

Using  a streamfunction-vorticity  formulation  to  the  axisymmetric 
incompressible  Navier-Stokes  equations,  it  was  found  [3]  that  one  can  reduce 
the  problem  to  a single  nonlinear  elliptic  equation  for  the  streamfunction , in 
case  of  a special  inflow  flow  and  some  regularity  assumption  on  the  vorticity. 
This  nonlinear  elliptic  equation  for  the  streamfunction  has  more  than  one 
solution.  The  trivial,  represents  a uniform  flow  and  is  of  no  physical 
interest.  The  other  shows  a "bubble"— like  structure,  the  target  of  our 
numerical  study. 

In  solving  the  problem  numerically,  the  problem  is  reformulated  in  terms 
of  a perturbed  streamfunction,  i.e.,  the  deviation  from  the  trivial  solution. 
In  terras  of  this  perturbed  streamfunction,  the  trivial  solution  is  represented 
as  an  identically  zero  solution.  Our  goal  then  is  to  find  non— zero  solutions 
which  have  "bubble"-like  form. 

The  approach  we  have  taken  in  finding  these  solutions  is  to  seek  first  for 
a bifurcation  point  from  the  trivial  branch  of  solutions.  By  introducing  a 
continuation  parameter,  we  can  then  start  marching  on  a branch  of  non-trivial 
solutions  that  bifurcate  from  that  point.  One  choice  of  a continuation 
parameter  is  arc  length  [1].  Another  choice,  which  is  simpler  but  may  not  be 
good  in  general,  is  the  norm  of  the  perturbed  streamfunction.  The  natural 
parameter  in  the  problem,  a swirl  velocity  parameter,  is  not  good  enough  since 
it  cannot  "choose"  the  non-zero  branch  as  can  the  former  parameter.  We 
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therefore  choose  the  norm  as  a continuation  parameter,  making  the  swirl 
velocity  parameter  an  unknown  to  be  determined  by  the  solution. 

The  multigrid  approach  used  for  solving  the  problem  is  similar  to  the  one 
used  in  [5]  for  solving  the  Bratu  problem.  The  relaxation  in  this  method 
consists  of  three  steps:  (i)  a local  relaxation  to  smooth  the  error;  (ii)  a 
step  to  update  the  norm  of  the  solution;  and  (iii)  a step  to  update  the  swirl 
velocity  parameter.  An  FMG  algorithm  (nested  iteration)  is  used.  That  is,  a 
solution  for  the  prescribed  norm  is  found  first  on  the  coarsest  level,  and 
then  interpolated  to  finer  levels,  where  on  each  level  a few  basic  multigrid 
V-cycles  are  performed  before  proceeding  to  yet  finer  level. 

The  coarsest  level,  when  solved  to  get  an  initial  approximation  for  finer 
levels,  uses  a continuation  method.  Here  the  problem  was  solved  first  for  a 
small  norm,  and  then  the  norm  is  gradually  increased  until  the  prescribed  norm 
is  reached.  Each  time  the  norm  is  increased,  the  solution  of  the  previous 
step  was  used  as  initial  approximation.  By  solving  for  a bifurcation  point 
from  the  trivial  solution,  a first  approximation  for  the  smallest  norm  problem 
was  obtained. 

Once  a solution  on  the  coarsest  level  is  obtained  for  a prescribed  norm, 
it  is  possible  to  solve  finer  grid  problems  without  continuation. 

The  same  problem  we  are  discussing  here  was  treated  by  a completely 
different  method  and  is  reported  in  [3].  There,  a single  grid  method  was  used 
with  a least  squares  formulation  of  the  problem.  The  amount  of  work  needed 
for  that  approach  is  considerably  larger  than  the  one  reported  here.  Computed 
solutions  by  the  two  different  formulations  are  in  good  agreement. 
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2.  ON  DERIVATION  OF  THE  GOVERNING  EQUATION 

We  summarize  here  the  derivation  of  the  equations  used  in  the  numerical 
process  as  given  in  [3].  In  cylindrical  coordinates  (x,r,0)  the 
incompressible  Navier-Stokes  equations  can  be  written  in  terms  of  a stream- 
function  ip,  vorticity  to,  and  circulation  k as 


*r 

r ~r  + +xx  = ™ (2*la) 


(uu)r  + (wu)x  + 


0) 

rr 


(2.1b) 


uk  + wk 
r x 


rr 


+ k 


xx 


(2.1c) 


where  k = rv,  to  = w^  - ux  and  Re  is  the  Reynolds  number.  The  velocity 
components  in  the  x,r,0  directions  are  w,  u,  v,  respectively,  of  which  w 
and  u are  given  in  terms  of  the  streamfunction  by 


w 


(2.2a) 


u = 


(2.2b) 


It  is  shown  in  [3]  that  in  the  inviscid  case  (Re  = ~),  one  finds  that  the 
circulation  k and  the  vorticity  to  are  functions  of  the  streamfunction  ip 
only.  Therefore,  k and  to  can  be  determined  outside  the  "bubble"  from  the 
inflow  boundary  condition.  In  the  model  discussed  it  is  assumed  that  the  same 
functional  dependence  of  k,  to  on  ip  is  true  also  inside  the  bubble 
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(negative  i|>).  This  imposes  some  regularity  on  the  solution 


For  the  inflow  conditions 


v(0,r)  = 


(vo  r(2  ■ r 

(Vr 


2 


) 


r < 1 
r > 1, 


(2.3a) 


w(0,r)  = 1, 


(2.3b) 


it  is  possible  to  write  k and  w in  terms  of  the  streamfunction  as 


16  V2  i|»2(l 


- rpy 


k (0,r)  = 


< V2 

t > V2 


(2.4a) 


'16  V2(  1 + 2rp2  - 3i^)(r2/2  - *) 


u(0,r)  = 


< V2 
^ > V2 


(2.4b) 


and  therefore,  the  equation  obtained  for  is 


xrCi{Jr/r-) r + \|»xx  = - 4V2  a2 ( xp) ( ip  - r2/2) 


(2.5a) 


where 


«2(^) 


!4(1  + 2 «{/2  - 3ip) 

0 


f < V2 

i|>  > V2  • 


(25b) 


The  reduction  of  the  governing  equations  to  a single  nonlinear  elliptic 
equation  is  possible  if  the  relation  ij)  = f(r)  in  the  inflow  boundary  can  be 
inverted  to  get  r = g(>J>).  When  g(^)  is  introduced  in  the  expression  for 
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v at  the  inflow  boundary,  one  has  v as  a function  of  \p  in  that  boundary 

and  therefore  u(ij>).  Note  that,  in  general,  one  cannot  expect  to 

analytically  invert  the  relation  = f(r),  and  so  the  reduction  of  the 

governing  equations  is  possible  only  for  very  special  inflows. 

2 

Numerical  experiments  were  done  in  terms  of  <|>  = xp  — •£—  , which  is  a 

2 2 

perturbation  from  the  trivial  solution  i|>  = j-  that  represents  a uniform  flow. 


3.  NUMERICAL  ALGORITHM 
3.1.  Discretization 

o 

The  equation  for  <J>  = ij>  - r /2  is  given  by 


rG?  <l>r)r  + <l>xx  + 4 V2  ct2(«J>)4>  = 0,  Si  = (0,a)  x (0,b)  (3.1a) 


<J>  = 0,  on 


(3.1b) 


where 


ct2(<J>) 


'4  $ - 1 + — (2<(>  - 1 + r2) 


<t>  + < V2 


otherwise 


(3.1c) 


Equations  (3.1)  are  discretized  as 


1+1,  J 


2<b^  + 4>h  r 

9lj  91-1,.1  , j 

h2  7 


2 r .h 

r4  ^1»  j+1 


rj  + l + ‘j 


♦«> 


+ r^^ij  *i,j-l) 


+ V2  a2(<|)Jj)((.Jj  « 0,  in  Slh  (3.2a) 


= 0,  on  80h  (3.2b) 
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where  ft*1  = {nh,mh),  0 < nh<  a,  0<mh<b}. 


3.2.  General  Strategy  for  Solving  the  Discretized  Equations 

Equation  (3.2)  has  the  trivial  solution  <j>^  = 0 for  any  Vq.  This 

solution  corresponds  to  a uniform  flow  and  is  not  interesting  physically.  We 

h 2 

seek  solutions  which  represent  vortex  breakdown  so  that  \\$  \\  * 0,  where 

ll<j>hll2  = h2  E <(>2j.  (3.3) 

Iterating  on  equation  (3.2)  by  any  iterative  method  may  lead  us  to  the  trivial 
solution.  In  order  to  rule  out  this  possibility,  we  specify  the  norm  of  the 
discrete  solution  we  want  to  find,  while  making  free  the  swirl  velocity 
parameter  Vq. 

To  summarize,  we  solve  equation  (3.2)  for  (4>^,  Vq)  under  the 

constraint 

»<t>hN2  = gQ,  (3.4) 

where  gQ  is  given. 

i. 

A relaxation  scheme  for  (<)>  ,Vq)  in  equation  (3.2)  together  with  the 
constraint  (3.4)  is  described  next. 


3.3.  Relaxation 

Equations  (3.2),  (3.4)  form  a nonlinear  system  of  equations  for  (^^jVq). 
The  relaxation  used  for  this  system  has  three  steps:  (i)  a local  process  for 
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smoothing  if)*1  in  equation  (3.2);  (ii)  a global  change  to  satisfy  (3.4);  and 
(iii)  updating  the  swirl  parameter  Vq.  That  is,  one  relaxation  consists  of 
doing  (i),  (ii),  and  (iii)  successively. 

(i)  local  relaxation 

h 

Scan  the  point  (i,j)  € ft  in  lexicographic  ordering;  at  each  point 
(i,j)  solve  (3.2)  approximately  for  by  applying  one  Newton 

iteration. 

(ii)  global  step 

Compute  3 = / g0/ll<j>hll2  . 

Then  make  the  change 


*hn  * 6 ♦«  • 


(i,j)  e an. 


(iii)  updating  Vq 

Change  Vq  such  that  the  following  equation  holds 


<Lh  <f>h  + 4V2  a2 ( <J>h ) <f>h , <f>h>  = <fh,<f>h> 


(3.5) 


h h 1 

where  L (j>  is  the  discretization  of  L<j>  = r(—  <J>  ) + <|)  ,<•,*> 

r r r xx 

2 Vi 

denotes  the  inner  product,  <u,v>  *=  h T u . . v . . , and  fh  is  the 

^ ij  iJ 

t_ 

right-hand  side  of  equation  (3,2).  (In  a multigrid  process  fn  is 

nonzero  on  coarse  grids.) 


We  now  come  to  the  description  of  the  multigrid  algorithm  used  to  solve  (3.2), 
(3.4)  for  (<|>h,  VQ). 


298 


3.4.1.  Basic  Cycle: 

Given  a sequence  of  discretizations  with  mesh  sizes 
hr  > 1*2  >•••>  hm,  where  h^  = 2hk+j.  The  h^-grid  equation  is  generally 
written  as 

Lk  4>k  = fk  (3.6) 

where  Lk  approximates  Lk+1  (k  < m)  (e.g.,  they  all  are  finite-difference 

approximations  to  the  same  differential  operator).  The  algorithm  for 

'v'k. 

improving  a given  approximate  solution  <f>  to  (3.6)  is  denoted  by 

?k  + MG(k,  ?k,  fk)  (3.7) 

and  is  defined  recursively  as  follows: 

If  k = 1,  solve  (3.6)  by  several  relaxation  sweeps;  qtherwise  do  steps 

(A)  - (D): 

(A)  Perform  relaxation  sweeps  on  (3.6),  resulting  in  a new 

approximation  <j>  . 

(B)  Starting  with  <|>k  ^ = Ik  ^ perform  one  cycle 

|k_1  4 MG(k-l,  ^k_1,  Lk_1  $k_1  + l£-1(fk  - Lk  $*)). 

(C)  Calculate 

rk  _ -rk  k r~k-l  k-1  -rk'1 

4>  - ♦ + lk_jU  - \ 4>  J. 

(D)  Perform  V2  additional  relaxation  sweeps  on  (3.6)  starting  with 
|k  and  yielding  the  final  ?k  of  (3.7). 
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are  fine-to-coarse  grid  transfer  operators; 


In  this  algorithm  Ik~*,  * 

Ik-1  is  an  interpolation  operator.  We  refer  to  the  above  cycle  as  MG(v1,v2). 
In  the  notation  of  this  section  (3.6)  includes  both  equations  (3.2)  and  (3.4). 

The  basic  cycle  described  above  is  for  improving  a given  approximation  on 
level  k.  The  full  multigrid  (FMG)  process  involves  solving  the  problem  on 
the  coarsest  grid,  interpolating  it  to  finer  grids,  and  making  the  cycle 
MG(v^,v2)  a few  times  after  each  refinement. 


3.4.2.  Full  Multigrid  Algorithm  (FMG) 

1.  Solve  (3.6)  for  k = 1,  using  a continuation  method  (see  remark 
below) . 

2.  Set  k = k + 1 and 

~k  k ~k- 1 k 

<J>  = ^k-1  ^ » where  is  a bicubic  interpolation. 

3.  Perform  y(k)  times  the  cycle 

«■  MG(k,  £k,  fk). 

4.  If  k < m,  go  to  step  2;  otherwise  stop. 

A Remark  on  Step  1 of  the  FMG  Algorithm  (Continuation  Method) 

Since  the  problem  involved  is  a nonlinear  one,  and  we  are  using  a Newton 
iteration,  a good  initial  approximation  may  be  needed  to  get  fast  convergence 
for  k = 1 (the  coarsest  grid).  This  has  been  achieved  by  using  a continuation 

1_  o 

process  where  we  solve  first  for  a small  norm  ll<j>  II,  then  gradually 
increasing  it  until  the  prescribed  norm  is  obtained.  Each  time  the  norm  is 
increased,  the  solution  of  the  previous  step  is  used  as  an  initial 
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approximation.  In  order  to  get  a good  initial  approximation  for  the  smallest- 
norm  problem,  we  have  solved  for  the  bifurcation  point  from  the  trivial  branch 
of  solutions. 


3.5  Solving  for  the  Bifurcation  Point 

& & 

At  a bifurcation  point  (<J>  , V^),  the  linearized  problem  of  (3.1)  must 
have  a zero  eigenvalue,  and  the  corresponding  eigenfunction  gives  rise  to  a 
second  branch  of  solutions.  Since  <f>  = 0 is  a solution  for  any  Vq,  we  may 
try  to  find  a bifurcating  branch  from  the  trivial  one  (0,Vq).  The 
linearized  equations  around  (0,Vq)  are  given  by 

W + r(iw  ) + 4V2  a2(0)W  =0,  in  ft  (3.8a) 

xx^rr'rO 

W = 0,  on  9ft.  (3.8b) 

If  there  exists  a bifurcating  branch  from  the  trivial  one  (0,Vq),  equation 

^ ^ A 

(3.8)  has  a solution  (W  ,Vq)  with  II W \\^  = 1 where  II  denotes  the 

norm. 

We  discretize  (3*8)  in  a way  similar  to  the  discretization  of  (3.1).  The 
constraint 

IIWhll2  = 1, 

is  added  to  ensure  a non-zero  solution  to  the  problem.  The  process  of  solving 
the  eigenvalue  problem  is  identical  to  the  process  of  solving  (3.2),  (3.4). 
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= ±eW  as 


Once  this  linear  eigenvalue  problem  is  solved,  we  can  use  ^ 

an  initial  approximation  for  our  original  problem  with  a prescribed  norm  of 

e.  The  sign  is  chosen  such  that  <f>n  has  negative  values,  to  ensure  that  the 

2 U 

total  streamf unction  + <j>  will  have  closed  streamlines  with  negative 

values  (the  bubble). 


4.  NUMERICAL  RESULTS 

Experiments  were  performed  with  equations  (3.2),  (3.3)  using  FMG 
algorithm  of  Section  3.4.2.  In  these  experiments  the  domain  was 

« {(nh,  Zh),  0 < nh  < 5,  0 < £h  < 2} . 

Three  levels  were  used  in  the  multigrid  algorithm  where  the  finest  grid 
problem  has  mesh  size  1/16.  On  the  coarsest  level  20  relaxations  were 
performed  while  on  finer  grids  vx  = v2  = 3,  -y(k)  = 4.  In  all  numerical 
experiments  1^  = 1^  is  injection,  1^  ^ is  bilinear  interpolation,  and 

is  bicubic  interpolation. 

Tables  I-IX  contain  the  L?-norm  of  the  residuals  and  the  values  of  V? 

c 0 

at  the  end  of  each  cycle  on  the  finest  grid.  Cycle  #0  refers  to  the 
approximation  obtained  from  the  previous  level  as  an  initial  guess.  Figures 
1-9  show  the  streamlines  (contours  of  i|>)  for  the  different  cases.  The  value 
of  VQ,  the  swirl  parameter  value  for  which  bifurcation  occurs  is  VQ  = 1.0069 
(computed  on  coarsest  level). 

The  experiments  clearly  show  that  the  multigrid  method  suggested  is  very 
efficient.  In  fact,  as  seen  by  the  convergence  history  for  Vq  , it  is  enough 
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to  take  y(k)  = 2,  Instead  of  y(k)  = 4,  i.e.,  by  2 FMG  cycles  the  problem  is 
already  solved. 

The  results  show  that  bigger  bubbles  are  obtained  for  smaller  swirl 
parameters,  contradicting  to  what  one  would  expect.  This  may  be  the  result  of 
the  assumption  made  in  the  model,  that  the  same  functional  dependence  of 
k,  m on  i|)  holds  inside  as  well  as  outside  the  bubble.  A future  study  will 
investigate  this  point  by  solving  the  full  systems  (2.1),  making  no  extra 
assumptions. 
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Table  I.  Il(|>hll2  = .005 


cycle  # 

II  Residuals  II 2 

V2 

0 

0 

.362  (-1) 

.95088 

1 

.986  (-3) 

.96069 

2 

.843  (-4) 

.96039 

3 

.148  (-4) 

.96041 

4 

.745  (-5) 

.96042 

Table  III. 

ll<t>hll2  = .11 

cycle  // 

II  Residuals  II 2 

V2 

vo 

0 

.122 

.59214 

1 

.233  (-2) 

.54739 

2 

.215  (-3) 

.54732 

3 

.615  (-4) 

.54733 

4 

.542  (-4) 

.54733 

Table  V. 

n<i>hii2  = .2 

cycle  // 

II  Residuals^ 

V2 

0 

0 

.150 

.42902 

1 

.242  (-2) 

.43301 

2 

.193  (-3) 

.43294 

3 

.366  (-4) 

.43294 

4 

.266  (-4) 

.43294 

Table  II.  Il<j>hl|2  = .05 


cycle  # 

II  Residuals  II 2 

0 

.948  (-1) 

.68322 

1 

.232  (-2) 

.68962 

2 

.251  (-3) 

.68939 

3 

.113  (-3) 

.68941 

4 

.918  (-4) 

.68941 

Table  IV. 

ll<J>hll2  = .15 

cycle  // 

II  Residuals  ll£ 

V2 

0 

0 

.135 

.48347 

1 

.243  (-2) 

.48803 

2 

.168  (-3) 

.48798 

3 

.474  (-4) 

.48798 

4 

.425  (-4) 

.48798 

Table 

VI. 

ll<f>hll2  = .4 

cycle 

# 

II  Residuals  II 2 

V2 

0 

0 

• .192 

.30435 

1 

.271  (-2) 

.30725 

2 

.239  (-3) 

.30719 

3 

.177  (-3) 

.30719 

4 

.176  (-3) 

.30719 
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Table  VII. 

2 - .6 

v2 

0 

0 

.230 

.24006 

1 

.303 

(-2) 

.24335 

2 

.218 

(-3) 

.24231 

3 

.188 

(-3) 

.24231 

4 

.175 

(-3) 

.24231 

Table  IX. 

2.0 

cycle  // 

II  Residuals  II 2 

V2 

0 

0 

.428 

.10176 

1 

.701  (- 

•2) 

.10276 

2 

.777  (- 

•3) 

.10275 

3 

.584  (- 

•3) 

.10275 

4 

.574  (- 

■3) 

.10275 

Table  VIII.  !l<|>hll2  = 1.0 


< 

O N3 

0 

.295 

.17139 

1 

.385  (-2) 

.17303 

2 

.363  (-3) 

.17302 

3 

.294  (-3) 

.17302 

4 

.278  (-3) 

.17302 
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CONSTRUCTION  OF 

HIGHER  ORDER  ACCURATE  VORTEX  AND  PARTICLE  METHODS 


R.  A.  Nicolaides 
Carnegie-Mellon  University 


ABSTRACT 

The  standard  point  vortex  method  has  recently  been  shown  to  be  of  high 
order  of  accuracy  for  problems  on  the  whole  plane,  when  using  a uniform 
initial  subdivision  for  assigning  the  vorticity  to  the  points.  If  obstacles 
are  present  in  the  flow,  this  high  order  deteriorates  to  first  or  second- 
order.  This  paper  introduces  new  vortex  methods  which  are  of  arbitrary 
accuracy  (under  regularity  assumptions)  regardless  of  the  presence  of  bodies 
and  the  uniformity  of  the  initial  subdivision. 


This  work  was  supported  by  the  Air  Force  Office  of  Scientific  Research  under 
Grant  AF0SR-84-0137. 


1.  INTRODUCTION 

There  has  been  a growing  interest  recently  in  the  theory  and  application 
of  point  vortex  methods  to  the  numerical  solution  of  the  incompressible  Euler 
and  Navier-Stokes  equations.  The  impetus  for  the  Euler  case  stems  from  the 
basic  work  of  Dushane  [6],  Hald  and  Del  Prete  [7],  and  Hald  [8],  the  Fourier 
analysis  of  Beale  and  Majda  [1],  [2],  [3],  and  the  Sobolev  space  approach  of 
Ravlart  [12]  and  Cottet  [4],  A recent  paper  by  Cottet  and  Gallic  [5]  extends 
the  latter  approach  to  linear  Burger's  type  equations  with  "viscosity" 
accounted  for  by  splitting  the  convection  and  viscous  parts  and  using  a 
Green's  function  for  the  viscous  computation.  A method  for  introducing 
viscosity  into  particle  methods  for  compressible  flows  is  given  by  Monaghan 
and  Gingold  [9].  See  also  [10]  and  [11].  Apart  from  the  first  three  of  these 
references,  the  authors  all  obtain  high  order  of  accuracy  error  estimates, 
limited  mainly  by  the  regularity  of  the  exact  solution  of  the  continuous 
equations.  Unfortunately,  the  possibility  of  obtaining  this  accuracy  is 
dependent  on  the  existence  of  expansions  similar  in  nature  to  the  Euler- 

MacLaurin  sum  formula.  If,  for  any  reason,  it  is  not  possible  to  assert  the 

existence  of  such  expansions,  the  accuracy  drops  to  first-  or  second-order, 
depending  on  the  exact  details  of  the  algorithm  and  which  errors  are  being 
estimated.  If  general  boundaries  (bodies)  are  present  in  the  flow  field,  or 
if  the  initial  subdivision  of  the  flow  field  is  not  uniform,  the  necessary 
expansions  will  most  likely  cease  to  exist.  Then  questions  arise  as  to  how 
higher-order  schemes  may  be  constructed,  and  more  important  whether  it  is 
worthwhile  to  use  them  in  view  of  the  extra  expense  which  is  involved.  The 

purpose  of  the  paper  Is  to  give  some  possible  answers  to  these  questions. 


313 


In  Section  2,  the  basic  equations  are  given,  and  the  simplest  particle 
method  is  defined  for  comparison  with  some  higher-order  schemes.  These 
schemes  are  introduced  in  Section  3.  There,  three  methods  for  generating 
schemes  of  arbitrary  accuracy  are  provided.  An  appendix  contains  some 
technical  results  about  solving  scalar  hyperbolic  equations  with 
distributional  data. 

This  paper  is  of  an  algorithmic  nature  and  does  not  contain  numerical 
results  or  precise  error  estimates.  These  will  appear  elsewhere. 


2.  MODEL  PROBLEM 

The  incompressible  Euler  Equations  in  vorticity-velocity  form  are 

u>t  + (uco)x  + (vu))y  = 0 'j  (2.1) 

( in  IF? 

div(u,v)  = 0 : curl(u,v)  = w j (2.2) 

with  initial  condition 

w(x,y,0)  = u)0(x,y).  (2.3) 

The  basic  ideas  for  constructing  higher-order  schemes  will  be  shown  for  (2.1) 
and  (2.3),  with  (u,v)  assumed  given.  For  these  linear  problems  it  is  not 
necessary  to  assume  that  (u,v)  is  solenoidal. 

In  this  setting,  we  will  now  define  the  basic  particle  (or  point  vortex) 
method.  Subdivide  the  plane  into  squares  of  side  h,  number  the  squares 
1,  2,  3,...  in  some  convenient  way  and  define  a distributional  approximation 
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(2.4) 


to  u)Q(x,y)  by 


o)oh(x,y) 


- I 

i 


h (d(x. 


6(x-x. 


y-Yi) 


where  (x^,y^)  denotes  the  center  of  the  i*"*1  mesh  square,  and  6(x-x^,y-y^) 
denotes  the  Dirac  delta  function  with  pole  at  (x^,yi).  Now  solve  (2.1)  and 
(2.2)  with  u>g(x,y)  a)Qji(x,y).  The  well  known  solution  to  the  latter 

problem  is  the  distribution 


oh(x,y,t)  E I h “(x^y^  <5(x-  X(xi,yi;t),  y - Y(x1,yi;t))  (2.5) 


where  X(x£,y^,t)  denotes  the  solution  of  the  characteristic  equation 


dX/dt  = u(X,Y , t)  X(0)  = x± 
and  correspondingly  for  Y. 

No  use  is  made  of  the  uniformity  of  the  mesh  in  deriving  (2.5).  For  a 
nonuniform  mesh,  h in  (2.5)  is  the  area  of  the  appropriate  mesh  square.  In 
the  error  formulas  below,  h denotes  the  largest  mesh  length. 

It  is  immediately  clear  from  this  definition  that  the  particle 
approximation  is  non— dissipative , in  the  sense  that  no  artificial  viscosity  is 
introduced  because  after  the  discretization  of  the  initial  condition  is  made 
(2.1)  is  solved  exactly.  In  practice  some  ODE  solver  must  be  used  to  compute 
the  trajectories,  but  in  theory  its  error  can  be  made  arbitrarily  small.  This 
principle,  of  solving  the  exact  equation  with  approxirate  data,  seems  to  be 
common  to  particle  methods  generally  and  distinguishes  them  from  finite 
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difference  and  finite  element  methods.  The  latter,  at  least,  solves  an 
approximate  equation  with  exact  data, 

A rigorous  error  analysis  of  the  method  just  defined  can  be  found  in 
[12],  This  analysis  is  too  complicated  to  reproduce  here.  Nevertheless,  we* 
need  some  simple  guide  to  compare  the  accuracy  of  various  schemes.  It  seems 
reasonable  to  look  at  the  difference  against  a test  function  as  a 

measure  of  11  truncation  error”  since  it  is  the  only  error  made.  Thus  we 

define,  for  a given  method  of  approximation  and  a given  function  u)q  with 
compact  support  Q (where  area  (fl)  = 1 say) 

Th(<f>)  = //  (“o  “ “oh^dxdy*  (2.6) 

Here,  the  integration  is  performed  over  H?.  The  restriction  that  ui^  has 
compact  support  is  a matter  of  convenience  rather  than  necessity  and  could  be 
replaced  by  sufficiently  rapid  decay  at  large  distances  from  the  origin. 

As  an  example,  consider  (2,4),  Then  we  find 

Th^  = H w0  <|,dxdy  “ I h2^u0  (2.7) 

This  shows  that  a midpoint  rule  numerical  integration  is  being  used  to 
approximate  the  integral,  and  under  smoothness  conditions  it  follows  that  as 
h + 0 

Th($)  = 0(h2). 

Clearly,  higher-order  integration  formulas  can  be  compared  with  each  other  on 
this  basis.  For  a 2 x 2 product  Gauss  rule  in  each  element,  for  example,  we 

4 

have  = 0(h  ) , 
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Next,  recall  the  Important  fact  that  in  the  nonlinear  case  it  is 
necessary  to  compute  the  velocity  field  at  each  timestep  by  solving  (2.2). 
Assume  that  this  is  to  be  done  using  the  Green's  function.  Let  W denote  the 
number  of  arithmetical  operations  required  to  compute  the  velocity  field  at 

O 

each  particle  position.  If  there  are  N particles,  then  W ^ CN  /2,  for 
some  constant  C.  Below,  we  will  use  W as  a standard  unit  of  work  to 
compare  various  new  algorithms.  For  the  Gauss  case  therefore  we  have  a work 
count  of  16W.  From  this  we  see  that  use  of  a higher-order  rule  does  not 
necessarily  assure  a greater  computational  efficiency  for  typical  values  of 
h.  In  the  next  section,  methods  for  obtaining  high-order  accuracy  without 
such  a large  increase  in  the  cost  of  the  computation  are  defined. 


3.  HIGHER  ORDER  METHODS 

The  preceding  remarks  suggest  that  increasing  the  order  of  accuracy  by 
adding  more  integration  nodes  may  not  be  a good  idea.  It  is  natural  to  try  to 
do  the  same  thing  by  increasing  the  amount  of  information  associated  with  each 
node.  Specifically,  in  this  section  we  shall  associate  with  (x^y^,  mth 
order  distributions  of  the  form 


M^x.y)  = l w1q  Da  6(x-xi,  y-  y^.  (3.1) 
|a|<m 

In  (3.1),  which  generalizes  the  simple  6 functions  in  (2.4),  a denotes  a 
multi-index,  and  (x^.y*)  e 1?.  Choice  of  the  weights  w±  and  the  nodes 
(xi»Yi)  can  be  made  in  many  ways.  We  shall  give  three  methods  in  this 
section. 
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Method  1 (Direct  Integration) : 


In  this  method,  (xi,yi)  are  the  corners  of  the  elements,  each  of  which 
has  associated  with  it  an  expansion  of  the  form  (3.1),  The  weights  in  the 
expansion  are  chosen  so  that  when  w0h  is  substituted  into  (2.6),  the  second 

term  gives  a rule  for  integration  of  the  function  (u)Q  <}>),  involving  its 

values  along  with  those  of  its  derivatives  through  order  m at  the  nodes.  We 
shall  consider  the  cases  m = 0 and  m = 1 in  more  detail. 

Let  m = 0.  A rule  for  a square  of  side  h with  corners  at  P,  Q,  R,  S 

which  is  exact  for  bilinear  functions  is 

//  f dxdy  « (h2/4)  (f(P)  + f(Q)  + f(R)  + f(S)).  (3.2) 

2 

Using  this  as  a composite  rule  implies  the  choice  w^qq  = " “^i’^i^  so 
that  we  define 

M1(x,y)  = h2w(xi,yi)  6(x-xi,  y-y± ) . (3.3) 

Since  this  gives  a rule  which  is  locally  exact  for  linear  functions  but  not 

for  all  quadratics  its  accuracy  is  0(h2)  in  the  sense  of  (2.6)  while  the 

work  is  1W.  This  is  essentially  no  different  from  the  mid-point  rule.  In 

fact  this  rule  is  clearly  analogous  to  the  trapezoidal  rule. 

For  a quadrilateral  mesh,  a bilinear  mapping  can  be  used  to  map  the 
quadrilaterals  onto  a standard  square  in  which  (3.1)  can  be  used.  In  some 

circumstances  it  may  be  desirable  to  use  a triangular  mesh  instead  of  the 
quadrilateral  one.  An  0(h2)  rule  for  triangles  analogous  to  (3.1)  can  then 
be  used,  avoiding  the  need  to  map  the  domains. 
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Now  let  m = 1.  Analogous  to  (3.2)  we  have  the  formula 


If  f dxdy  a A(f(P)  + f (Q)  + f(R)  + f(S)) 

+ B(-fx(P)  + fx(Q)  + fx(R)  - fx(S))  (3.4) 

+ C(-fy(P)  - fy(Q)  + f y(R)  + f y(S)) 

o n 

where  A = h /4,  B=C  = hJ/24,  and  P,  Q,  R,  S denote  the  corners  of  the 

square  -h/2  <_  x,  y h/2  labelled  counterclockwise  starting  from  the  top 

right.  Analogous  to  (3.3)  there  is  the  expression 

Mi(x,y)  = l wia  Da  6(x-x1,  y-y^.  (3.5) 

I«|<1 

In  (3*5),  the  coefficients  are  computed  from  the  composite  rule  based  on 

(3.4) .  For  the  uniform  square  mesh  we  are  using  for  illustration,  the  weights 
are 

Wi00  = A'“o(xl’yi)  + B'u0x(xi’yi)  + C'%(xi’yi) 

WilO  = -B'“o(xi’yi) 

wi01  = -C^0(xi,y1). 

(3.4)  is  exact  for  cubic  polynomials.  It  follows  that  this  method  is  accurate 
in  the  sense  of  (2.6)  to  O(h^).  To  compute  work  units  for  this  scheme,  we 
observe  that  although  there  are  only  - N particles  there  is  some  extra  work 
associated  with  computation  of  derivatives  of  the  velocity  kernel.  It  turns 
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out  that  for  this  scheme  the  work  units  are  < 2 — W,  a satisfactory  figure. 
There  is  also  some  additional  work  required  for  computing  the  coefficients  of 
the  derivatives  in  (3.1)  • This  amounts  to  having  to  integrate  two  more 
systems  each  of  two  odes,  in  addition  to  the  characteristic  odes  (see 
appendix) . 

As  in  the  previous  case,  rather  than  use  a quadrilateral  mesh  it  might 
sometimes  be  better  to  use  a triangular  one. 

For  a square  mesh,  the  m = 1 scheme  just  discussed  has  an  interesting 
property  in  the  uniform  case.  This  is  the  following:  due  to  cancellations, 
the  composite  rule  has  weights  of  zero  attached  to  the  derivative  unknowns  at 
interior  vertices.  Hence  the  higher  accuracy  is  achieved  by  corrections  at 
the  boundary.  But  this  implies  the  use  of  a Euler-Maclaurin  type  expansion. 
Thus,  if  o3(f>  has  s continuous  derivatives  in  and  compact  support,  by 
using  nodal  derivatives  up  to  this  order  we  can  get  accuracy  0(hs+*)  merely 
by  using  the  m = 0 scheme,  since  this  is  what  the  composite  scheme  reduces 
to  on  a uniform  mesh  in  that  case.  This  is  another  way  to  look  at  the  results 
of  [1]  - [3], 


Method  2 (Finite  Element  Approach) : 

The  approach  here  uses  a nodal  finite  element  basis  in  the  following 
way:  let  {ij^}  lal  £.  m>  * = 1>2,***,  be  the  standard  nodal  basis  functions 

t"  Vi 

associated  with  the  i node  (x^yp  of  a triangulation  of  the  plane  with 

maximum  edge  length  h.  These  functions  satisfy  conditions  of  the  form 


DP  \|>.  (x.  ,y .) 
i<*  J J 


> 
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where 


is  a Kronecker  delta.  Then  we  define  w 


as 


ia 

w±a  = (-l)^JJ  ^ia(x,y)  a)Q(x,y)dxdy  (3.6) 

where  the  integration  is  over  the  whole  plane.  We  now  have 

//  a,0h^x,y')  p(x*y)dxdy  = //  I I wia  Da6(x-xit  y-y±) 

1 |a!<m 

x <J)(x,y)dxdy,  V <|>  € Cm  (IF?) 

(3.7) 

= I l (-1)^  wia  Da  <|)(xi,yi) 

1 lal£m 

= //  a)Q(x,y)  ())h(x,y)dxdy 

where  <j)  is  the  finite  element  interpolant  of  <}>  on  the  given 

triangulation.  Equation  (2.6)  then  becomes 

Th(<t>)  = If  o>0((J>  - <j>h)dxdy.  (3.8) 

Since  the  error  is  formally  0(hr+*)  where  r is  the  degree  of 

the  highest  order  full  polynomial  space  used,  we  can  say  here  that  is  of 

this  order. 

This  type  of  scheme  differs  from  direct  integration  schemes  in  that  no 
approximation  of  o)q  is  made.  The  test  function  only  (often  a convolution 
kernel  in  practice)  is  approximated  and  the  result  is  integrated  exactly. 
Because  of  this  property,  the  rigorous  error  estimates  for  these  methods 
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require  minimal  regularity  on  u)Q  unlike  the  direct  integration  methods  where 
to  achieve  high  accuracy  requires  Wq  to  have  several  smooth  derivatives 
throughout  1R?.  The  0(hr+1)  estimate  is  in  fact  valid  even  if  we  know 
only  Wq  € L^( . If  ioq  has  extra  regularity  it  can  be  exploited  to  get 
higher  accuracy  by  going  to  negative  norm  estimates  of  the  finite  element 
error.  Smoothness  of  <j>,  however,  is  certainly  required. 

Two  examples  analogous  to  those  considered  above  are  the  case  of 
continuous  linear  elements  on  triangles,  for  which  we  can  expect  O(h^) 
accuracy  with  1W  work  units,  and  full  cubics  - defined  in  terms  of 
derivative  unknowns  at  vertices,  and  function  values  at  vertices  and  centroid 
for  which  the  work  will  be  somewhat  larger  than  the  values  used  so  far  (about 
10  W units). 

In  general,  the  full  range  of  finite  element  spaces  is  available  for  use. 

Method  3 (Taylor/Moment  Expansions): 

Here  we  begin  by  subdividing  the  plane  into  arbitrary  elements  with  mid- 
side nodes  and  arbitrary  element  shapes  allowed  in  principle.  Next,  we  define 

“l!  a2!  Wia  = / / Cx-x± ) 1 (y-  y±)  2 u>0(x,y)dxdy  (3.9) 

in  which  (x^,y^)  is  an  arbitrary  point  within  the  i^  element,  and  the 
integration  is  over  the  ith  element.  The  w^  are  proportional  to  the 
moments  of  restricted  to  the  ith  element,  about  (x^y^.  It  follows  as 

above,  that 

ff  “oh^y)  <Kx,y)dxdy  = //  u)Q(x,y)  <|> ^ (x,y)dxdy  (3.10) 
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where  <f>^(x,y)  is  the  piecewise  polynomial  function,  in  general 
discontinuous,  equal  in  the  iL  element  to  the  Taylor  expansion  of  <{>(x,y) 
through  mth  order  terms,  about  the  point  (xi>yi)*  In  this  sense  the  local 
moment  expansion  defined  by  (3.1)  and  (3.9)  "dualizes"  into  the  local  Taylor 
expansion  about  (x^,y^). 

To  get  the  accuracy  of  this  scheme,  we  substitute  into  (2.6)  to  find  that 

Th^)  = H “ <}>tml)dxdy 

so  that  denoting  by  h the  largest  linear  dimension  of  the  elements,  we 

obtain  accuracy  0(hm+*). 

The  moments  method  also  needs  only  minimal  regularity  on  o)q  for  full 
accuracy  to  be  obtained.  In  practice,  if  m = 1 the  point  (x£,y^)  should 
be  chosen  to  be  the  center  mass  of  ojq  because  then  w^  = 0 for  |ct|  = 1, 
so  we  get  second-order  accuracy  for  the  same  work  as  with  the  lowest-order 
scheme.  Using  quadrilaterals  for  elements,  with  N vertices  there  are 

O 

approximately  N elements  and  so  N particles.  For  0(h)  accuracy  the 
interaction  work  count  is  5W,  and  for  O(h^)  is  8W. 


4.  FURTHER  REMARKS 

There  should  be  no  difficulty  in  extending  the  ideas  of  Section  3 to 
three-dimensional  particle  methods  of  the  kind  suggested  in  [1]  - [3]  and 
[12]. 

Rigorous  analysis  using  the  Sobolev  space  setting  has  been  carried  out 
for  both  the  finite  element  and  moment  expansion  methods. 
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So  far  an  insufficient  amount  of  computation  has  been  done  to  verify  the 


error  estimates  and  decide  about  the  efficiency  of  the  various  methods. 
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APPENDIX 


A framework  for  finding  distributional  solutions  of  (2.1)  with  initial 
condition  = 6(x-Xq,  y-yg)  lal  S.  m can  be  obtained  starting  from  the 

following  considerations.  Let  X(xQ,yQ;t)  and  Y(xQ,yg;t)  denote  the 
characteristic  curves  of  the  equation  (2.1);  here,  t parameterizes  the  curve 
and  the  generic  point  (xg,yg)  denotes  its  origin  at  time  t * 0.  X and  Y 
are  computed  by  solving  the  ordinary  differential  equations 

dX/dt  = u(X,Y, t)  dY/dt  - v(X,Y,t) 

X(0)  = xQ  Y(0)  = yQ. 

At  time  t,  let  J(xq,  yg;  t)  denote  the  Jacobian  of  the  flow  map 
$ : (xg,yg)  + (X,Y).  The  (nonlinear)  case  of  most  interest  from  the  fluids 
viewpoint  has  ux  + vy  “ 0,  which  case  J(xg,yg;t)  = 1.  We  can  obtain  a 
formal  analytical  solution  to  (2.1)  and  (2.3)  by  writing  the  equation  in  terms 
of  the  material  derivative  as  dw/dt  = 0,  integrating  this  equation  over  an 
arbitrary  domain  moving  with  the  velocity  field  (u,v),  say  £2(t),  and  then 
using  the  transport  theorem  to  write 

d/dt  //  co(X,Y,t)dXdY  = 0, 

£l(t) 

from  which  it  follows  immediately  that 

//  w(X,Y,t)dXdY  = //  wn(x,y)dxdy. 

n(t)  n(0) 
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Changing  the  variables  on  the  right~hand  side  to  X and  Y respectively  and 
recalling  the  arbitrariness  of  Cl(t)  now  gives 

o)(X,Y;t)  = a»0(x(X,Y,t),  y(X,Y,t))j-1  (X,Y;t)  (A.l) 

where  (x(X,Y,t),  y(X,Y,t))  is  by  inverting  the  equations  X = X(x,y:t),  Y = 
Y(x,y;t).  The  existence  of  a unique  solution  to  these  equations  follows  from 
ode  theory  provided  u and  v are  smooth.  Reversing  the  steps,  it  follows 
that  (A.l)  satisfies  (2.1)  given  the  required  regularity  of  u,  v,  and  Uq. 

Let  <f>  € C (1?);  multiplying  (A.l)  by  <|>,  integrating  and  changing  the 
variables  on  the  right  to  x and  y we  have 

//  w(X,Y,t)  <fi(X,Y)dXdY  = //  a)Q(x,y)  <(>(X(x,y; t) , Y(x,y,t))dxdy,  (A. 2) 

or  alternatively 

<w,  $>  = <a>0,  <|>o(X,Y)>  (A. 3) 

where  o denotes  composition.  If  X( • , • , t)  and  Y( • , • , t), 

Y(.,.f  t)  eHm+1,”(i?)  (or  € ¥ 0 < t < T,  then  the  right-side  of 

(A. 3)  makes  sense  even  if  w0  + a)0h  = Da  6(x-xQ,  y-yQ)[a|  < m.  Thus  a 
distribution  oj  is  defined  on  <P(m)(l£)  by  (A. 3).  Therefore,  we  can  pose 
the  problem  of  finding  satisfying 

<V  <i>>  - <m0h,  ()> o(X,Y)>  V 4>  C <C(m)  (l?j.  (A. A) 
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A solution  01^  to  (A. 4)  is  given  by 

a).  (X,Y)  = Da  6(X  - X(x,y;t),  Y - Y(x,y;t))  , (A.5) 

n x-xQ,  y-yQ 

the  purely  formal  differentiations  being  performed  w.r.t.  x and  y.  Proof 
that  (A.5)  satisfies  (A. 4)  is  by  direct  computation. 

If  |a|  = 0 we  recover  the  solution  given  in  Section  2.  Consider  the 
case  with  |a|  = 1.  Equation  (A.5)  gives 


“hio  ■ Mx-v  Y-V  + Mx-xo-  Y-Yo)  Yx(X0'VC) 

“hOl  = Sx(X_X0’  Y-Y0>  *y(  W')  + Sy(X-X0’  Y'Yo) 


(A. 6) 


using  the  abbreviation  Xq  for  X(xQ,yQ;t)  and  similarly  Yq.  If  the 
initial  condition  is 


40 hO  = a10  6X(X“V  y~y())  + a01  6y(x“V  y_y())  * 

then  the  solution  to  (A. 4)  of  the  required  form  as  given  by  (A. 6)  is 


“h  ■ *io(t)  sx(x-x0'  Y-V  + aoi(t)  5y(x_xo’  y-  V 

where 

aio(t)  = aio  + aoi  y 

(A. 7) 

a0i(t)  = a10  Yx(x0,y0,t)  + aQ1  Yy(x0>y0>t). 
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Letting  M denote  the  matrix 


differentiation  of  the  characteristic  equations  shows  that 


dM/dt  = V(u ,v)M 


and  the  initial  condition  for  this  system  is  M(0)  = 1,  the  identity  matrix. 
It  will  be  necessary  to  solve  this  and  analogous  systems  for  the  higher-order 
cases  in  order  to  compute  the  numerical  approximations.  Having  solved  it, 
a^Q(t)  and  aQj(t)  are  given  by  (A. 7). 
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ABSTRACT 

A pseudo-time  method  is  introduced  to  integrate  the  compressible  Navier- 
Stokes  equations  to  a steady  state.  This  method  is  a generalization  of  a 
method  used  by  Crocco  and  also  by  Allen  and  Cheng.  We  show  that  for  a simple 
heat  equation  that  this  is  just  a renormalization  of  the  time.  For  a 
convection-diffusion  equation  the  renormalization  is  dependent  only  on  the 
viscous  terras.  We  implement  the  method  for  the  Navier-Stokes  equations  using 
a Runge-Kutta  type  algorithm.  This  enables  the  time  step  to  be  chosen  based 
on  the  inviscid  model  only.  We  also  discuss  the  use  of  residual  smoothing 
when  viscous  terms  are  present. 
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I.  INTRODUCTION 


The  solution  of  the  compressible  Navier-Stokes  equations  for  flow  about 

two-  and  three-dimensional  complex  aerodynamic  configurations  is  still  a time 

consuming  problem  on  today's  supercomputers.  The  resolution  of  the  boundary 

layers  requires  the  use  of  very  fine  meshes  in  the  neighborhood  of  solid 

bodies.  For  a typical  viscous  flow  the  mesh  can  be  several  orders  of 

magnitude  finer  (depending  on  the  Reynolds  number)  than  that  required  for  an 

inviscid  calculation.  As  an  example,  using  a C-type  mesh  about  an  NACA  0012 

airfoil,  a typical  mesh  spacing  near  the  body  in  the  normal  direction  for  an 

-2 

inviscid  calculation  is  1 x 10  chords.  For  a laminar  viscous  calculation 

3 -4 

with  Re  = 5 x 10  , this  minimum  cell  height  would  be  about  6 x 10 

chords.  For  a turbulent  calculation  using  an  algebraic  turbulence  model  and 

with  Re  - 3 x 10^,  the  minimum  cell  height  would  be  about  8 x 10 

chords.  In  all  cases  a typical  chordwise  spacing  at  the  midsection  of  the 

-2 

airfoil  is  about  5 x 10  chords. 

Using  an  explicit  method  this  fine  mesh  reduces  the  time  step,  due  to 
stability  requirements,  that  can  be  used.  The  time  step  restriction  is  caused 
by  two  factors.  One  contribution  is  due  to  the  effect  of  the  finer  mesh  on 
the  inviscid  portion  of  the  calculation.  When  using  an  explicit  method  this 
reduction  of  the  time  step  cannot  be  avoided  without  using  a coarser  mesh.  It 
follows  strictly  from  the  need  to  include  the  entire  domain  of  dependency  in 
the  numerical  algorithm.  Use  of  a local  time  step  allows  faster  convergence 
to  a steady  state,  but  it  does  not  remove  the  requirement  to  satisfy  the 
convection  stability  condition  in  a local  sense.  A second  difficulty  is 
caused  by  the  viscous  terms.  For  an  explicit  method  the  time  step  is  now 
dependent  on  the  square  of  the  mesh  size  rather  than  just  the  mesh  size  as 
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occurs  for  inviscid  flow.  Thus,  even  for  a high  Reynolds  number  flow  the 
viscous  time  step  will  dominate  when  the  mesh  is  sufficiently  fine.  In  all 
these  cases  the  use  of  an  implicit  scheme  will  alleviate  the  difficulties.  In 
some  ADI  methods  the  Jacobian  of  the  viscous  terras  is  not  used  in  the  implicit 
portion  of  the  code  in  order  to  improve  the  speed  of  the  calculation  [7],  We 
thus  conclude  that  for  both  explicit  and  many  implicit  codes  it  is 
advantageous  to  account  for  the  dependence  of  the  time  step  on  the  viscous 
terms. 

In  this  study  we  shall  only  discuss  steady  state  problems  which  are  solved 
by  a pseudo  time-dependent  method.  Hence,  we  can  change  all  time  derivatives 
as  long  as  the  steady  state  solution  is  not  affected.  One  common  device  is  to 
use  a different  time  step  in  each  zone.  It  is  easier  to  calculate  this  local 
time  step  based  on  the  inviscid  equations.  This  provides  an  additional  reason 
to  eliminate  the  dependence  of  the  time  step  on  the  viscous  terms. 

In  this  study  we  shall  analyze  a method  used  by  Crocco  [4]  and  also  by 
Allen  and  Cheng  [2],  They  claim  that  the  new  scheme  is  unconditionally  stable 
for  a simple  diffusion  equation.  We  will  show  that  in  effect  the  scheme  is  a 
standard  Euler  forward-in-time  central-in-space  scheme.  The  time  is 
artificially  slowed  down  so  as  to  satisfy  the  stability  criterion.  We  then 
extend  this  scheme  to  the  compressible  Navier-Stokes  equations  using  a Runge- 
Kutta  scheme  [9],  This  modification  enables  us  to  choose  our  time  step  based 
on  the  inviscid  equations.  The  modification  automatically  reduces  the  local 
time  step  In  regions  where  the  viscous  time  step  is  of  importance.  This 
enables  us  to  use  the  inviscid  time  step  in  the  far  field  while  automatically 
accounting  for  viscous  effects  in  the  boundary  layer.  We  will  also  look  at 
residual  smoothing  for  the  heat  equation. 
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II • SCALAR  EQUATION 

In  this  section  we  analyze  and  extend  a scheme  for  the  Navier-Stokes 
equations  proposed  by  Crocco  [4]  and  Allen-Cheng  [2]*  This  scheme  was  also 
analyzed  by  Peyret  and  Viviand  [6]  and  Roache  [8],  and  we  will  extend  their 
analysis. 

We  first  consider  the  heat  equation 
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The  forward  time  centered  space  or  Euler  approximation  to  this  scheme  is  given 
by 
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Crocco,  and  Allen/Cheng  introduce  the  inconsistent  scheme 
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This  scheme  is  unconditionally  stable.  If  we  are  only  interested  in  the 
steady  state,  then  (4)  yields  the  correct  steady-state  solution.  We  now 

rewrite  (4)  as 
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or 


(5) 


with 
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Thus,  for  this  model  problem  the  Crocco  scheme  is  identical  with  the  Euler 
scheme  (2)  with  an  artificial  time  step  At^  given  by 


2e 

2 * 

(Ax) 


(7) 


Thus,  the  unconditional  stability  is  achieved  by  slowing  down  the  time 
process.  Note  that  as  At  + «*,  At  + (Ax)^/2e,  i.e.,  the  stability  limit  for 
the  Euler  method.  So  choosing  a large  time  step  for  (4)  is  equivalent  to 
choosing  At^  at  the  stability  limit  for  (2),  and  we  have  merely  scaled  the 
time.  This  can  also  be  derived  from  the  modified  equation  given  in  [6].  If 
e or  Ax  is  not  constant,  this  also  introduces  a local  time  step. 

We  next  consider  the  convection-diffusion  equation 
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The  Crocco  scheme  now  becomes 
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with  At  given  by  (6).  Thus,  again  this  is  equivalent  to  the  Euler  scheme 
with  a time  scaling  that  depends  only  on  the  viscous  terms*  Allen  and  Cheng 
utilized  this  scheme  within  a time-marching  scheme  proposed  by  Brailovskaya 
[3],  We  generalize  this  by  considering  a general  N-stage  Runge-Kutta  scheme. 

Consider  the  two-dimensional  equation 
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where  Hw  describes  the  hyperbolic  or  first-order  terms.  In  [9]  we  describe 
a Runge-Kutta  scheme  where  the  viscous  terms  are  frozen  for  all  the  stages. 
This  is  similar  in  philosophy  to  the  Brailovskaya  scheme.  Using  the  Crocco 
formulation  the  (K  + l)-st  stage  becomes 


(K+l)  n 

W3,k  ~Wj,k 

°K+1  At 


H W(K)  + — - — (w11  - 2w^+^  + w11  1 

^ j’k  (Ax)2  1 Vl,k  j>k 


(12) 


+ — - — fw11  - 2w^K  + w11  1 K-0  1 •••  N-l 

(Ay)2  '•"j.k+l  /wj,k  K U’A’  ’N 


This  reduces  to  a Runge-Kutta  scheme 
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where  Hp,  Pp  are  the  approximations  to  the  hyperbolic  and  parabolic  parts 
respectively  and 
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We  slightly  generalize  (14)  by  redefining  Atfi  by 
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(15) 


where  k is  a constant  that  we  can  choose.  The  fora  of  (15)  no  longer 

follows  directly  from  the  Crocco  formulation.  Instead  k will  be  chosen 

based  on  a stability  analysis. 

We  choose  At  in  (12)  or  (15)  based  on  the  hyperbolic  (inviscid) 

stability  condition.  We  then  find  At^  from  (15)  and  advance  to  stage 
(K  + 1)  using  the  Runge-Kutta  scheme  (13). 

The  constant  k in  (15)  can  be  chosen  so  that  we  recover  the  parabolic 
stability  limitation  when  HD  = 0.  The  exact  value  of  k depends  on  the 
coefficients  in  the  Runge-Kutta  formula.  In  order  to  see  this  more 

clearly  we  revert  to  the  one-dimensional  convection-diffusion  equation  (8). 
We  replace  all  space  derivatives  by  second-order  central  differences  while  the 
time  derivative  is  kept  continuous.  We  therefore  have 
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We  Fourier  transform  (16)  to  get 


w^  = Xw 

with 


X(5)  = - — 9 (1  ~ cos  5)  + ~ sin  5 

(Axr 


0 < 5 < 2tt. 


(17) 

(18) 


A Runge-Kutta  scheme  for  (16)  or  (17)  is  stable  whenever  z(£)  = X(5)Atg  lies 
within  the  stability  domain  that  depends  on  for  all  0 5 < 2ir. 
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We  consider  the  stability  domain  for  the  four-step  scheme  with  = 1/4, 

= 1/3,  ctg  = 1/2,  = 1.  This  scheme  has  a stability  condition  along  the 

imaginary  axis  of  max|z|  2/2",  i.e.,  for  a hyperbolic  problem  (e  = 0) 
aAt  _ ? 

<.  2/2.  Along  the  negative  real  axis  the  stability  condition  is 

2eAt 

| z | < 2.8  and  for  a parabolic  problem  (a  = 0)  =-  < 2.8.  Hence  for  this 

(Ax)2  ~ 

case  we  would  choose  k in  (15)  as  k = 1.4.  We  define  the  cell  Reynolds 
number  as 


a Ax 
'h  e * 


(19) 


The  previous  analysis  shows  that  the  Runge-Kutta  scheme  is  stable  for  R^  = 0 
and  R^  = We  do  not  have  any  proof  that  the  scheme  is  stable  for  all  R^. 


III.  NAVIER-STOKES  EQUATIONS 

We  now  discuss  the  implementation  of  these  ideas  to  the  two-dimensional, 
compressible,  Navier-Stokes  equations.  The  extension  to  three  dimensions  is 
straightforward.  We  first  consider  the  conservation  form  in  Cartesian 
coordinates.  We  express  the  equations  in  the  following  form 
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and  Hj  denote  first  derivative  terms  (including  the  artificial  viscosity  and 
also  the  viscous  dissipation  function).  The  coefficients  of  viscosity  (p 
and  X),  y the  specific  heat  ratio,  and  the  Prandtl  number  Pr  are  all 
assumed  (for  the  analysis)  to  be  locally  constant. 

In  deriving  our  results  we  shall  ignore  all  cross  derivatives  (see,  e.g., 
[1],  [2]).  Based  on  our  previous  analysis  we  add  the  following  terms  to  the 
standard  Runge-Kutta  scheme. 
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where  Aw  = w - w11  and  denote  the  usual  space  derivative  terms. 

For  simplicity  we  have  chosen  k = 1,  and  a denotes  the  constant  in  the 
Runge-Kutta  scheme  (28).  Thus  the  density  equation  is  unchanged.  The  second 
and  third  equations  can  be  solved  directly  for  A(pu),  A(pv).  Once  A(pu), 
A(pv)  are  known  the  last  equation  can  be  solved  for  A(pE).  As  before  these 
corrections  imply  an  effective  time  step  which  automatically  accounts  for  the 
viscous  time  step.  In  this  case  the  effective  time  step  differs  for  each 
equation. 

We  finally  consider  the  Navier-Stokes  equation  in  body  fitted 

coordinates.  This  can  be  done  either  in  a finite  volume  scheme  or  by  using 
transformations.  The  result  is  the  same  in  either  case  [9],  and  so  we  shall 
use  a transformation  for  ease  of  presentation.  Let  g = g(x,y),  n = n(x,y) 
be  the  body  fitted  coordinates.  We  choose  the  coordinate  scaling  so  that 
Ag  = An  = 1.  The  Navier-Stokes  equations  (20)  now  become 
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(22) 


where  Hj  are  first  derivative  terms  and  we  have  ignored  all  second  cross 
derivative  terms.  As  before  this  generates  an  appropriate  correction  term  to 
the  Runge-Kutta  scheme.  Equation  (21)  is  now  replaced  by 
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where  represents  the  standard  finite  difference  terms. 

As  before  the  density  equation  is  unchanged  by  the  viscous  correction. 
Now,  however,  the  two  momentum  equations  are  coupled  together,  unless  the 
coordinate  system  is  orthogonal.  As  we  have  two  equations  for  A(pu)  and 
A(pv),  and  we  can  easily  solve  these.  To  simplify  the  notation  we  define 

Zl  = l+^l[(X+2u)U2  + q2)  + y(52  + h2)] 

Z2  = ^ (X  + P)(5x  5y  + \ V (24) 

z4  = 1 + UU*  + \)  + U + 2u)U2  + n2)] 

and 

D - (1  + 2^(1  + z4)  - z2. 

Then 
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(25) 


Ap  = K, 


A(pu)  = 


K2  z4  “ K3  z2 


A(pv)  = 


K3  Z1  “ K2  z2 


As  before  given  A(pu)  and  A(pv)  we  can  solve  for  A(pE)  directly  from  the 
energy  equation  in  (23).  We  also  note  that  if  one  uses  the  thin  layer 
approximation  (dropping  all  second  £ derivatives  and  cross  derivatives  in 
(22))  then  these  terms  simplify  slightly.  In  this  case  Ap,  Apu,  Apv  are 
still  given  by  (25)  with 
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IV.  RESIDUAL  SMOOTHING 


As  an  alternative  method  of  reducing  the  effect  of  the  parabolic  terms  on 
the  stability  of  the  scheme  we  consider  residual  smoothing.  With  this 
technique  one  post-processes  an  explicit  method  with  an  implicit  method.  In 
practice  one  post-processes  each  equation  separately  and  each  direction 
separately  so  that  only  scalar  tridiagonal  matrices  need  be  inverted.  When 
using  a multistage  Runge-Kutta  method,  one  can  apply  the  residual  smoothing 
after  each  stage,  or  at  the  end  of  the  entire  process,  or  any  intermediate 
permutation. 

In  [10]  it  is  shown  that  one  can  construct  such  a scheme  for  a hyperbolic 
equation  so  that  the  total  method  is  unconditionally  stable.  It  is  further 
shown  in  [10]  that  it  is  not  efficient  to  use  a very  large  At  even  ignoring 
splitting  errors.  An  optimal  At  is  about  two  to  three  times  larger  than  the 
explicit  time  step.  We  now  consider  the  process  for  a parabolic  problem  in 
order  to  see  the  effect  of  viscous  terms. 

We,  therefore,  consider  the  heat  equation 


u^  = bu 
t xx 


We  solve  this  equation  by  a k-stage  Runge-Kutta  scheme 


U(1)  = un  + Oj  AtQu 


(£+1)  n . Aj_  (£) 

u = u + ot£+^  AtQu 


n+1  (k) 

u = u 


(27) 


(28) 
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where  are  given  coefficients  with  = 1.  Q is  a difference 

approximation  to  uxx.  The  amplification  factor  corresponding  to  (28)  is 

G - 1 + AtQ  + B2(At)2  Q2  + ...  + 8k(At)k  Qk  (29) 

A 

where  3^=1  and  3^  = ak-£.+l’  * ~ 2,***>k.  Q *s  t'ie  Fourier 

transform  of  Q.  Hence,  for  second-order  central  differencing 

- . . 4b  sln2(8/2)  _ (30) 

(Ax) 

Residual  smoothing  consists  of  updating  a stage  (A)  by 

(1  - aD2)Au(il)  = u(A)  - un  (31) 

where  is  again  a second-order  central  difference  approximation  to  uxx, 

i.e.,  + (1,-2,1).  We  now  consider  two  possibilities.  In  the  first  we 

apply  (31)  only  after  the  final  stage.  Then  the  new  amplification  factor  is 

B.  AtQ  + B-,(At)2  Q2  + ...  + B.(At)k  Qk 

G (9)  = 1 + — - . (32) 

1 + 4o  sin  (0/2) 

The  second  case  we  consider  is  applying  (31)  after  every  stage.  The  resultant 
amplification  factor  is 


with 


G2(0)  = 1 + Bt  AtR  + 32(At)2  R2  + + 0k(At)k  Rk 


R = 


1 + 4a  sin  (0/2) 


(33) 

(34) 
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We  now  investigate  the  possibility  that  either  of  these  schemes  is 
unconditionally  stable.  To  investigate  this  we  need  only  consider  At 
sufficiently  large.  We  thus  consider  At  -*■  " with  a + ~.  Then  (32)  becomes 

<-Dk  8k[^  =i"2(|)]k 

G (6)  1 + . (35) 

1 + 4cx  sin  (0/2) 

We  thus  see  that  for  k even,  G^e)  > 1 and  so  (28)  - (31)  cannot  be  stable 
for  At  large.  For  k = 1 the  scheme  is  identical  with  backward  Euler  for  a 
scalar  one-dimensional  equation  and,  hence,  unconditionally  stable.  For  the 
second  case  we  see  that  (33)  has  the  same  form  as  a standard  Runge-Kutta 

A A 

method  with  Q replaced  by  R,  (34).  Hence,  it  follows  that  the  scheme  is 

A 

stable  whenever  AtR  is  within  the  stability  region  of  the  scheme.  As 
At  00 , so  does  a and  so  there  is  a cancellation  between  the  numerator  and 

A 

denominator;  thus,  AtR  remains  bounded  as  At  increases.  We  thus  conclude 
that  applying  the  residual  smoothing  after  each  stage  can  make  the  scheme 
unconditionally  stable  even  for  a Runge— Kutta  method  with  an  even  number  of 
stages. 

We  also  see  from  the  above  argument  that  as  At  increases  so  must  cr.  In 
[9],  [10]  we  show  that  for  a hyperbolic  equation 

ut  + aux  = 0 

9 

that  a is  proportional  to  (aAt/Ax)  • For  the  parabolic  problem  (27)  it 
follows  from  (35)  that  a should  be  proportional  to  bAt/(Ax)^.  For  the 
combined  convection-diffusion  equation  a will  be  related  to  the  sum  of  two 
such  contributions. 


346 


It  follows  from  (33),  (34)  that  if  we  apply  residual  smoothing  after  every 
stage  then  the  stability  polynomial  has  the  same  form  as  the  original 

A A 

polynomial  (29).  The  only  difference  is  that  Q is  now  replaced  by  R.  From 

A A A 

(34)  it  follows  that  the  ratio  of  Q to  R is  real.  Hence,  if  Q is  any 

A 

complex  number  then  R lies  along  the  same  ray  in  the  complex  plane  but  with 
a different  amplitude.  We  therefore  have  shown  that  if  the  original  scheme 
was  unstable  for  a given  direction  then  residual  smoothing  cannot  stabilize 
the  scheme.  Furthermore,  if  the  original  scheme  was  conditionally  stable  then 
by  choosing  a = a(At)  sufficiently  large  we  can  make  the  scheme 
unconditionally  stable.  We  have  thus  shown 

A 

Theorem:  Let  Q be  the  amplification  factor  for  any  approximation  to  the 

convection-diffusion  equation  and  let  (29)  be  the  stability  polynomial  for  a 


k stage 

Runge-Kutta 

scheme. 

We  now  apply  residual  smoothing,  (31),  after 

every  stage  of  the 

scheme. 

If 

the  original  scheme  was  unconditionally 

unstable 

then  the  new  scheme 

is 

still  unconditionally  unstable.  If  the 

original 

scheme  was 

conditionally 

stable  then  the  scheme  with  residual 

smoothing  can  be  made  unconditionally  stable  by  choosing  o(At)  sufficiently 
large. 


Hence,  if  the  smoothing  is  applied  at  the  end  when  solving  a parabolic 
equation,  then  the  scheme  can  be  unconditionally  stable  only  when  using  a 
multistage  scheme  with  an  odd  number  of  stages.  When  the  smoothing  is  done 
after  each  stage,  the  scheme  can  be  stabilized  for  a large.  For  a system 
with  a hyperbolic  portion  and  a small  parabolic  contribution,  e.g.,  high 
Reynolds  number  Navier-Stokes , the  residual  smoothing  is  most  effective  with  a 
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time  step  about  twice  that  of  the  explicit  convective  portion.  Hence,  the 
question  of  unconditional  stability  is  somewhat  academic.  In  practice  [8]  the 
Runge-Kutta  scheme  for  the  Navier-Stokes  equations  is  used  with  four  stages 
and  with  the  residual  smoothing  applied  after  each  stage. 


V.  RESULTS 

^ fchis  section  we  present  some  results  for  viscous  flow  obtained  using 
the  analysis  of  Sections  II  and  III.  We  used  a Runge-Kutta  code  to  solve  the 
Navier-Stokes  equations  for  two  flows  about  an  airfoil  section.  The  details 
of  this  code  are  discussed  in  [5],  [9],  [10],  [11].  In  these  cases  we 

considered  only  the  thin-layer  form  of  the  Navier-Stokes  equations. 

For  the  first  case  we  computed  laminar  flow  over  an  NACA  0012  airfoil  with 

a free-stream  Mach  number  M of  0.5  and  a Reynolds  number  Re  of 

00  00 

3 

5 x 10  . The  angle  of  attack  (a)  of  the  airfoil  was  zero  degrees.  Half- 
plane calculations  were  performed  using  a C-type  grid  consisting  of  64  cells 

in  the  streamwise  direction  and  64  cells  in  the  normal-like  direction.  The 

grid  spacing  at  the  airfoil  surface  was  about  6 x 10-^  chords.  The  mesh 

spacing  in  the  streamwise  direction  over  the  central  part  of  the  airfoil  was 
AX  = 0.05  chords.  Results  for  this  case  are  shown  in  Figures  la  - lc.  As 
indicated  in  Figure  lb,  the  flow  separates  at  X = 0.817  chords.  The  size  of 
the  recirculation  zone  is  displayed  in  Figure  lc.  The  results  are  all 

independent  of  the  time  step  procedure  used  to  reach  the  steady  state. 

In  Figure  Id  convergence  histories  for  this  case  for  two  calculations  are 
shown.  The  residual  displayed  in  this  graph  is  the  root  mean  square  of  the 
residual  of  the  continuity  equation.  The  calculations  were  started 
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impulsively  by  inserting  the  airfoil  into  a uniform  flow  and  immediately 
enforcing  the  appropriate  boundary  conditions.  Local  time  stepping  and 
enthalpy  damping,  (see  [9])  were  employed  in  each  computation;  no  residual 
smoothing  was  used.  For  history  A the  Runge-Kutta  scheme  with  the  time  step 
(At)  limitation  determined  by  convection  was  used;  this  required  choosing  a 
CFL  = 1.0.  For  curve  B a larger  Courant-Friedrichs-Lewy  (CFL)  number  was 
used  by  accounting  for  the  diffusion  limit  on  At  with  the  pseudo-time 
algorithm.  This  allowed  choosing  CFL  = 2.5  based  on  an  inviscid 

criterion.  There  is  additional  work  with  the  pseudo-time  scheme. 
Nevertheless,  the  computational  time  required  to  reach  a satisfactory  level  of 
convergence  was  reduced  by  a factor  of  1.7. 

In  the  second  case  we  solved  for  turbulent  flow  over  an  NACA  0012  airfoil 
with  = 0.5,  Re^  = 2.89  x 10^,  and  a = 0 degrees.  A 60  x 50  half-plane 
grid  was  used  in  the  computations.  The  grid  spacing  at  the  surface  was  about 
8.5  x 10  ^ chords.  The  chordwise  spacing  at  the  midsection  of  the  airfoil 
was  about  AX  = 0.036  chords.  Numerical  results  for  this  case  are  presented 
in  Figures  2a  and  2b. 

Figure  2c  shows  two  convergence  histories  for  this  turbulent  flow  case. 
As  in  the  laminar  flow  problem,  the  histories  were  obtained  by  computing 
without  and  with  the  effects  on  At  due  to  diffusion.  The  pseudo-time 
algorithm  was  about  1.4  times  faster  in  reaching  steady  state.  This  is  close 
to  the  factor  expected,  since  we  were  able  to  increase  the  CFL  from  1.5  to 
2.7,  a factor  of  1.8.  We  do  not  achieve  this  speedup  of  1.8  since  there  is 
some  reduction  of  the  effective  time  step  due  to  the  diffusion  terms. 
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VI.  CONCLUSIONS 


The  use  of  the  Crocco  scheme  for  a scalar  convection-diffusion  equation 
introduces  a scaling  of  the  time  step.  This  reduces  the  effective  time  step 
so  that  the  viscous  stability  limit  is  automatically  satisfied.  As  such  the 
scheme  cannot  introduce  any  fundamental  acceleration  in  reaching  the  steady 
state.  The  advantage  of  the  scheme  is  that  we  do  not  need  to  explicitly 
account  for  the  viscous  time  step  restriction;  it  is  done  automatically.  This 
can  be  done  efficiently  using  Runge-Kutta  type  schemes.  In  addition,  for 
variable  coefficients  or  nonuniform  meshes  this  introduces  an  effective  local 
time  step. 

Using  this  scheme  for  a system  of  equations,  e.g.,  Navier-Stokes , has  the 
additional  benefit  that  a different  scaling  is  chosen  for  each  equation.  Thus 
each  equation  has  its  own  appropriate  (viscous)  time  step.  This  is  equivalent 
to  using  a diagonal  preconditioning  [10]  to  accelerate  the  equations  to  a 
steady  state.  Computations  demonstrate  that  we  can  gain  a factor  of  between 
1.5  and  2 with  little  programming  effort. 

We  further  show  that  if  one  uses  residual  smoothing  to  increase  the  time 
step  then  one  must  also  account  for  the  viscous  terms.  When  the  smoothing  is 
applied  after  the  completion  of  a Runge-Kutta  cycle  then  unconditional 
stability  is  possible  only  if  an  odd  number  of  stages  is  used.  Applying  the 
smoothing  after  each  stage  allows  for  unconditional  stability  for  all 
multistage  schemes  provided  o is  chosen  sufficiently  large. 
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.30 


Figure  lb.  Skin-friction  (based  on  free-stream  conditions)  distribution  for 
laminar  flow  over  an  NACA  0012  airfoil  (M  =0.5  Re  = 5 x 103 
a = 0 degrees). 
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Figure  1c.  Streamlines  for  upper  surface  at  the  trailing  edge  (M^  - 0.5, 

3 

Re  •=  5 x 10  , a = 0 degrees). 


Iterations 


Figure  Id.  Convergence  histories  for  laminar  airfoil  flow  calculations. 

A — Runge-Kutta  scheme  without  pseudo-time  algorithm  (CFL  number 
of  1.0). 

B — Runge-Kutta  scheme  with  pseudo-time  algorithm  (CFL  number  of 
2.5). 
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Figure  2c.  Convergence  histories  for  turbulent  airfoil  flow  calculations. 

A — Runge-Kutta  scheme  without  pseudo-time  algorithm  (CFL  number 
of  1.5). 

B — Runge-Kutta  scheme  with  pseudo-time  algorithm  (CFL  number  of 
2.7). 
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I n+roduc+ion 


It  is  natural  that  the  rapid  evolution  of  increasingly  powerful  computers 
should  inspire  attempts  to  solve  previously  intractable  problems  by  numerical 
calculation.  One  might  imagine  that  within  a fairly  short  time,  advances  in  pro- 
cessing speed  and  memory  capacity  ought  to  reduce  the  simulation  of  physical 
systems  governed  by  partial  differential  equations  to  a matter  of  routine.  The 
numerical  computation  of  solutions  of  nonlinear  conservation  laws  has  proved,  in 
fact,  to  be  perhaps  unexpectedly  difficult.  Discontinuities  are  likely  to 
appear  in  the  solution,  and  schemes  which  are  accurate  in  smooth  regions  tend  to 
produce  spurious  oscillations  in  the  neighborhood  of  the  discontinuities.  These 
oscillations  can  be  eliminated  by  the  use  of  strongly  dissipative  first  order 
accurate  schemes,  but  these  schemes  severely  degrade  the  accuracy  and  usually 
produce  excessively  smeared  discontinuities. 

The  scalar  nonlinear  conservation  law  in  one  space  dimension 


3 u 3 , . . n 

St  +^f<u>  = 0 


(I) 


provides  a model  which  already  contains  the  phenomena  of  shockwave  formation  and 
expansion  fans.  Thus  it  can  be  used  to  provide  insight  into  the  likely  beha- 
vior of  numerical  approximations  to  more  complex  physical  systems,  while  it  is 
still  simple  enough  to  be  fairly  easily  amenable  to  analysis.  A rather  complete 
mathematical  theory  of  solutions  to  ( 1 ) is  by  now  available  11-3). 


Equation  (1)  describes  wave  propagation  at  a speed 

, x 3 f 

alu>  “ sir  • 
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The  solution  is  constant  along  the  characteristic  lines 

at  = a(u) 

provided  that  they  do  not  intersect  to  form  a shock  wave.  Tracing  the  solution 
backward  along  the  characteristics,  it  can  be  seen  that  the  total  variation 


00 

TV(u)  = / 

— 00 


is  constant  prior  to  the  formation  of  a shockwave,  while  it  may  decrease  when 
the  shockwave  is  formed.  Correspond i ng  to  this  property  it  may  be  observed  that 
no  new  local  extrema  may  be  created  and  that  the  value  of  a local  minimum  is 
non-decreasing  while  the  value  of  a local  maximum  is  non-increasing.  It  follows 
that  an  initially  monotone  profile  continues  to  be  monotone. 


It  seems  desirable  that  these  properties  should  be  preserved  by  a numerical 
approximation  to  (1),  This  will  guarantee  the  exclusion  of  spurious  oscilla- 
tions in  the  numerical  solution,  Harten  [41  has  recently  introduced  the  concept 
of  total  variation  diminishing  (TVD)  difference  schemes,  which  have  the  property 
that  the  discrete  total  variation 


00 

TV(v)  = Z | v 

k=-oo  k 


of  the  solution  vector  v cannot  increase.  Harten  also  devised  procedures  for 
constructing  both  explicit  and  implicit  TVD  schemes  14,5). 


The  purpose  of  this  paper  is  to  state  and  prove  conditions  for  the  construc- 
tion of  multi-point  TVD  schemes.  Conditions  are  derived  for  explicit,  implicit, 
and  also  semi -d i screte  operators  to  be  TVD.  The  conditions  are  both  necessary 
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and  sufficient  in  the  case  of  the  explicit  and  semi -d i screte  schemes.  The 
reasoning  is  a modification  and  extension  of  the  reasoning  used  by  Lax  in  an 
appendix  to  reference  5.  The  results  were  first  presented  in  a lecture  at  ICASE 
in  March  1984.  The  present  paper  is  an  amplification  and  revision  of  a 
Princeton  University  report  issued  under  the  same  title  in  April  1984  16). 

In  the  intervening  period  Osher  and  Chakravarthy  have  given  another  proof  that 
conditions  (3.12)  are  sufficient  for  an  explicit  scheme  to  be  TVD  17). 
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2.  Conditions  for  Reduction  of  the  Norm 


One  dimensional  difference  operators  act  on  doubly  infinite  sequences 

u * {\}  > ~ “ < k < » . (2.1) 


The  l | norm  of  such  a vector  u is  defined  as 


H,=  1 |u  | • (2.2) 

— 00 

The  space  of  all  vectors  u with  finite  norm  is  denoted  by  ij. 

A difference  operator  maps  Aj  into  and  is  of  the  form 

A,U)k  ■ l aj  Vj  • <2-3) 

j 

The  coefficients  aj  depend  on  k,  either  explicitly  or  through  dependence  on  u. 
In  either  case  we  write 

a . = a . (k) . 

J J 

Theorem  A:  The  operator  A defined  by  (2.3)  satisfies 

|A(u)  \]  < | u | 1 (2.4) 

for  all  u in  if  and  only  if 

I la  j <h+j  > I < 1 (2.5) 

j J 

for  all  h. 


An  operator  A satisfying  (2.4)  is  a contraction. 

Proof*  The  signum  function  is  defined  for  every  real  u by 
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1 ( for  u > 0 

signum  u = 0 < for  u = 0 . (2.6) 

-1  ( for  u < 0 


Now  set 


sk  = signum  A(u)k;  (2.6*) 

then,  by  definition  (2.2)  of  the  norm  and  definition  (2.6)  of  signum  we  have 


|A(u)|,  = l |A(u)  | = l sk  A(u)  = 
k k 

= V10  Vj  ’ aJ"’+J>  Vjuh 

■ l wh  % < 


where 

"h  = iaj(htJ,sh+J  • 

J 

Since  S|<  takes  on  the  values  ±1  or  0,  it  follows  from  (2.7)  that 


(2.7) 


(2.7*) 


l"h I < l laj  <h+J } I * 

J 

It  follows  therefore  from  assumption  (2.5)  that 


for  all  h.  Setting  this  into  (2.7)  we  deduce  that  (2.4)  holds  for  all  u in  ij. 


To  show  the  necessity  of  (2.5)  suppose  on  the  contrary  that  it  fails  for 
some  h = hg.  Set  u^  equal  to 


u(0) 

SL 


1 for  = hQ 
0 for  X.  * ho 


For  this  u<0)  it  follows  from  (2.3)  that 


(2.8) 
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and  so 


. , (0) , 

Mu  >k = ak-h0‘k) 


|A<“<0)>  l,=  L |A<u"">kl  - i |afc_h  <k)| 


(0), 


= L laj(h0+j)  | > 1 
J 

since  hQ  was  so  chosen  that  (2.5)  is  violated.  On  the  other  hand  it 
from  (2.8)  that 

I (0)  | , 

|u  |,=  1 

This  combined  with  (2.9)  shows  that  (2.4)  fails  for  u<0). 


For  use  in  implicit  schemes  the  following  result  is  needed. 

Theorem  B:  Def i ne  the  operator  B by 

B(u)k  = l b.Ck>Vj. 

J 

B satisfies 

|B<u)  |t  > |u|, 

for  all  u in  jf 

b (h)  - i |b  (h+j)  | > 1. 
j+0  J 

An  operator  B satisfying  (2,11)  is  called  an  expans i on « 
Proof : We  define 

s = signum  u 
K K 

Since  |sk  | < 1 , 

|B(u)|t  = X |B(u)k|  > l sk  B(u)k  . 
k k 

Analogously  to  (2.7),  (2.7)*  we  have 


(2.9) 


is  obvious 


(2.10) 


(2.11) 


(2.12) 


(2.13) 


(2.14) 
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(2.15) 


where 


l \ B<u)k  ’ i % uh 

k h 


\‘i  bj<h*j> sh+J 

J 


(2.15)* 


It  follows  readily  from  (2.12)  that  if  u t 0, 


Using  (2.13)  we  get 


These  two  imply  that 


K > 1 * 


signum  wh  = signum  . 


I \ uh  > iJli  • 

h 


(2.16) 


Combining  (2.14),  (2.15),  and  (2.16)  we  get  (2.11). 

We  remark  that  (2.12)  is  far  from  being  necessary  for  B to  be  expansive. 
For  example,  take  the  right  shift  operator  T,  with 


. { ' ,or  J 

J ( 0 for  j 


= 1 
* 1 


Clearly,  T is  an  isometry: 

(Tu),  = |u|,, 

but  condition  (2.12)  is  utterly  violated. 

Theorem  A has  a continuous  analogue: 

Theorem  C:  Let  u(t)  be  a differentiable  function  of  t real  whose  values  lie 


in  %\,  and  which  satifies  a differential  equation  of  the  form 


du  , 

st  ■ 0<u>’ 


where  C is  a difference  operator,  i.e.,  an  operator  of  the  form 


(2.17) 


Clu)k  - \ cj  Vj- 

J 


(2.18) 
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The  coefficients  Cj  may  depend  on  k and  t either  directly  or  through  a depen- 
dence on  u.  Then  |u(t)|i  is  a non i ncreas i ng  function  of  t if  and  only  if  for 
all  h and  all  t 


c0(h> 

+ i |c  j (h+j ) | < 0, 
j*o  J 

(2.19) 

Proof : 

Define  s (t)  by 

Then 

s (t) 
k 

= signum  u^(t) 

(2.20) 

Since  each 

M+>  1,  = 1 sk(+)  uk(t)  . 
k 

sk  is  piecewise  constant. 

(2.21) 

H , du. 

dflu(+)  1,  = l sk<t)  • 

(2.22) 

According  to  equation  (2.17) 


dt 

Setting  this  into  the  right 


t 


= l c . (k)  u . (2.23) 

j J k~J 

in  (2.22)  we  get,  after  relabeling  the  index  of 


summation, 

^lu<t,l.  \ i °j<k>  Vj  * l "h  uh  (2-24) 

k j J J h 

where 

"h  - i cj(h+J)  sh+j  • <2-25) 

J 

Suppose  uh  * 0;  then  by  (2.20),  sh  * 0.  Multiply  (2.25)  by  sh;  using  assump- 
tion (2.19)  we  get 


sh  "h  = co<M  + |0cj(h+J)shVj  4 0 • 

Since  by  definition,  s^  and  uh  have  the  same  sign,  it  follows  that  for  all  h 


Uh  Wh  4 °! 
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this  relation  clearly  holds  also  when  uh  = o.  Setting  this  into  (2.24)  we 
obtain 

dtlu(+) I,  < 0 ; 

this  proves  that  | u ( t ) | ^ decreases  as  t increases. 

Next  we  indicate  why  condition  (2.19)  is  necessary.  Suppose  it  is  violated 
at  to,  hQ.  Let  u(t)  be  that  solution  of  (2.17)  whose  value  at  to  equals 


Using  (2.23)  we  get 


uk(t0)  = $k,h0 


for  k = hg 
for  k * ho 


Uk(t0 


+ e)  = 6 


. . + e l c . (k)6  . . 

k,hn  ; J k-j,h 


+ 0(e2) 


Summing  with  respect  to  k gives 

i u (t  + e)  = 1 t e i c.(hn  + j)  + 0(e2). 
k k 0 j J ° 


Since  condition  (2.19)  is  violated  at  tQ,  ho  we  conclude  that  for  e small  enough 
pos i t i ve , 


Since 


l uk(tQ  t e)  > 1 . 


Iu(t0+  e) h > ^ uk(+0  + e) 

k 

while 

|ut+0)  |,  = 1, 

this  shows  that  J u ( t ) | -(  is  not  a decreasing  function  of  t,  completing  the  proof 
of  Theorem  C. 
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3.  Construction  of  Total  Variation  Diminishing  Schemes 


Theorems  A,  B,  and  C may  be  used  to  find  conditions  on  the  coefficients  of  a 
difference  operator  which  guarantee  that  the  total  variation  of  a solution  does 
not  increase  for 

E)  explicit  schemes 
I)  implicit  schemes 
S)  semi -discrete  schemes. 


The  total  variation  of  a vector  u is 


TV<U>  -\K-  vii  • 


Using  the  right  shift  operator  T: 

T(u,k  = v, 


we  can  express  TV(u)  as 


TV(u)  = |(l-T)u|  . 


(3.1) 


We  turn  now  to  explicit  (2J+1)  point  schemes 


n+1  p.,  n. 

u = D(u  ) 


(3.2) 


where 


D(u)  = l d . (k)u  . . 


-J 


. \ / Vi, 

J k-j 


(3.3) 


We  assume  that  the  difference  operator  D preserves  constants.  In  view  of  (3.3), 
this  is  the  case  if 


l d . (k)  = 1 


(3.4) 


for  all  k.  Schemes  (3.3)  satisfying  this  condition  can  be  written  in  the  form 


D(u) 

k 

or  in  operator  notation 


+ l 


e.(k)(u,  .-u.  . ,) 

J k-j  k— j — 1 


(3.5) 
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(3.6) 


D = I + E(l-T), 

where 

E = I e.  TJ  . (3.6)* 

L J 

We  want  to  find  conditions  which  guarantee  that  D is  TVD,  i.e.,  satisfies  for 
all  u 


TV (Du)  < TV ( u ) . 


Using  formula  (3.1)  this  is  the  same  as 

|(l-T)Du|1  < | ( l-T)u  | , . 
Using  formula  (3.6)  we  can  write 


where 


(l-T)D  = ( l+( l-T)E)( l-T)  = A(l-T), 
A = I + ( l-T) E. 


(3.7) 


(3.7)* 


(3.8) 

(3.8) * 


We  now  set  (3.8)  into  (3.7)*;  denoting 


( l-T)u  = u* 

we  obtain  the  equivalent  inequality 

I Au*  | y < |u*|,  . 


(3.9) 


This  is  certainly  the  case  if  A is  an  contraction, 
in  Section  2 the  criterion  (2.5): 

l |a . (h+j ) | < 1 
j J 

where 

(Au>r  » I a.<k,Vj  . 


for  which  we  have  derived 


(3.10) 


It  follows  from  (3.8)*  that  the  coefficients  aj  of  A can  be  expressed  in  terms 
of  the  coefficients  ej  of  E as 
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aQ(k)  = 1 + eQ(k)  - e_1(k-1) 

and 

a (k)  = e (k)  - e (k-1),  j*0. 

J J J ' 

It  follows  from  these  relations  that 

L a (h+j)  = 1; 
j J 

but  then  (3.10)  can  hold  if  and  only  if  for  ail  j and  k 

a . (k)  > 0. 

J 

Using  (3.11),  (3.11)*  we  can  express  this  condition  as  follows: 

e_ 1 ( k- 1 ) > e_2(k-2)  > ...  >e_J(k-J)  > 0, 

-eQ(k)  > - e^k+1)  >...>  - eJ_1(k+J-1)  > 0, 

1 + eQ(k)  - e_1 (k-1)  >0. 

Thus  we  have  proved 

Theorem  E:  The  explicit  scheme  (3.3)  is  TVD  if  conditions  (3.12) 

satisfied  for  all  k,  where  ej  are  the  coefficients  appearing  in  formula  (3 
for  D. 


We  turn  next  to  implicit  schemes: 

c,  n+1.  n 
F(u  ) = u 

We  take  F to  be  a 2J+1  term  difference  operator  that  preserves  constants, 
an  F can  be  written  in  the  form 

F = I + G( l-T) 


where 


(3.11) 

(3.11) * 


(3.12) 


are 

.5) 


(3.13) 
Such 

(3.14) 
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(3.14)* 


G<u»k  = 1 g.(K,  Vj 

We  want  to  find  conditions  under  which  scheme  (3.13)  is  TVD,  i.e.,  for  all  u 

TV(Fu)  > TV(u)  . (3.15) 

Using  formula  (3.1)  this  is  the  same  as 

| ( l-T) Fu | > | ( l-T)u  | j . (3.15)* 

Using  formula  (3.14)  we  can  write 

( I — T ) F = (l  + (l-T)G)(l-T)  = B(l-T)  (3.16) 

where 

B = I t ( I — T ) G.  (3.16)* 

We  set  (3.16)  into  (3.15)*;  denoting 

( l-T)u  = u* 

we  obtain  the  equivalent  inequality 

|Bu*  | j > |u*  1 1 • (3.17) 

This  is  the  case  if  B is  an  expansion.  In  theorem  B we  have  derived  criterion 

(2.12)  that  guarantees  that  an  operator  B is  an  expansion: 


b (h)  > l |b.(h+j)  | + 1.  (3.18) 

0 j*0  J 

It  follows  from  (3.16)*  that  the  coefficients  bj  of  B can  be  expressed  in 
terms  of  the  coefficients  gj  of  G as 


bQ(k)  = 1 + gQ(k)  - g_ 1 (k-1 ) 

and 

b . (k)  = g (k)  - g . ,(k-1),  j*0 
J J J 

Adding  up  these  relations  we  deduce  that 

bn(k)  = 1 -l  b (k+j); 

0 j*0  J 


(3.19) 
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but  then  (3.18)  can  hold  if  and  only  if  for  all  k and  for  j + 0 

bj(k)  < 0. 

Using  (3.19)  these  conditions  can  be  restated  as 

gQ(k)  > g 1 ( k+ 1 ) >...  >gJ_1(k+J-1)  > 0 (3.20) 

and 

-g_](k-1)  > -g_2(k-2)  >...>-g_J(k-J)  > 0.  (3.20)* 

Thus  we  have  proved 

Theorem  I : The  implicit  scheme  (3.13)  is  TVD  if  conditions  (3.20),  (3.20)* 

are  satisfied,  where  gj  are  the  coefficients  of  the  operator  G related  by  formula 
(3.14),  (3.14)*  to  the  operator  F appearing  in  (3.13). 

We  remark  that  we  can  combine,  as  Harten  does,  theorems  I and  E to  study 
implicit-explicit  schemes  of  the  form 

n+1.  rw  n. 

F(u  ) = D(u  ).  (3.21) 

Such  a scheme  is  TVD  if  F satisfies  the  conditions  of  Theorem  I and  D the  con- 
ditions of  Theorem  E. 

Finally  we  turn  to  semi -discrete  schemes: 

du 

= Hu/  (3.22) 

with  H some  2J+1  point  difference  operator.  We  assume  that  u = const  is  a solu- 
tion of  (3.22);  this  is  the  case  if  H annihilates  all  constant  vectors.  In  this 
case  H can  be  written  in  the  form 


H(u)k  = 1 i>j<k>(Vj-  Vj_,>  . (3.23) 

or  in  operator  form 

H = M( l-T)  . (3.23)* 


We  want  to  find  conditions  on  H which  guarantee  that  TV(u)  is  a decreasing  func- 
tion of  t for  all  solutions  u of  (3.22).  By  formula  (3.1),  this  is  the  same  as 
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|( l-T)u(t) |, 

being  a decreasing  function  of  +.  So  we  multiply  (3.22)  by 
we  get 


where 


Denot i ng 


-(|-T)u 


(l-T)  M( l-T)u  = C(l-T)u 


C=( l-T)M. 


( l-T)u=u* 


(3.24)  becomes 

U*  = Cu*  . 
at 

According  to  Theorem  C,  |u*|i  is  a decreasing  function  of  t 
of  Section  2 is  satisfied: 

c.(k)  + l |c  .(k+j)  | 4 0 . 

U j*0  J 

Using  (3.25)  we  can  express  the  coefficients  cj  in  terms  of 
fol lows: 

c . (k)  = m . (k)  - m . , (k- 1 ) . 

J J J-1 

Thus 

l c (k+j)  = 0; 

j J 

It  follows  from  this  that  (3.26)  can  hold  if  and  only  if 

c . ( k+j ) > 0 , j t 0 . 

J 

Using  (3.27)  we  can  restate  this  as 


(l-T);  using  (3.23)* 

(3.24) 

(3.25) 

if  condition  (2.19) 

(3.26) 

those  of  M as 

(3.27) 
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and 


m_1(k-1)  > m_2(k-2>  > . . •>m_J (k-J)  >0 


(3.28) 


— mQ ( k ) > - rr^Ck+1)  >...  > mJ_1(k+J-1)  > 0 . (3.28)* 

Thus  we  have  proved 

Theorem  S:  The  semi-discrete  scheme  (3.22)  is  TVD  if  conditions  (3.28)  and 
(3.28)*  are  satisfied,  where  the  mj  are  the  coefficients  of  the  operator  M 
related  by  formula  (3.23)*  to  the  operator  H. 
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4.  Conclusion 


The  conservation  law  (1)  describes  a right  running  wave  when  a(u)  is  posi- 
tive. Conditions  (3.12)  and  (3.28)  of  Theorems  (E)  and  (S)  state  that  the 
explicit  and  semi-discrete  schemes  (E)  and  (S)  are  TVD  if  and  only  if  the  coef- 
ficients of  the  differences  u^_j  - u^-j-i  have  the  same  sign  as  a(u)  for  j > 0, 
(points  on  the  upwind  side),  and  the  opposite  sign  for  j < 0,  (points  on  the 
downwind  side).  If  the  differences  are  moved  over  to  the  right  of  equation 
(3.13),  then  condition  (3.20)  of  Theorem  ( *1 ) states  that  the  implicit  scheme  (I) 
will  be  TVD  if  it  satisfies  a similar  condition  on  the  sign  of  its  coefficients. 
In  all  three  cases  only  the  differences  on  the  upwind  side  have  the  correct  sign 
for  consistency  with  (1),  and  can  contribute  to  wave  propagation  in  the  correct 
direction.  In  this  sense  upwind  biasing  is  a necessary  feature  of  explicit  TVD 
schemes,  and  it  is  also  useful  in  the  construction  of  implicit  TVD  schemes. 

It  is  thus  not  surprising  to  find  that  most  of  the  attempts  to  design  schemes 
with  the  capability  of  capturing  shockwaves  and  contact  discontinuities,  dating 
back  to  the  early  work  of  Courant,  Isaacson  and  Rees  [8i,  and  Godunov  [91,  have 
introduced  upwinding  either  directly  or  indirectly.  Second  order  accurate 
upwind  schemes  have  been  devised  by  Van  Leer  [10],  Harten  [4],  151,  Roe  111], 
Osher  and  Chakravarthy  112],  and  Sweby  113).  These  all  use  flux  limiters  to 
attain  the  TVD  property. 

Another  approach  to  the  construction  of  TVD  schemes  stems  from  the  obser- 
vation that  central  difference  formulas  for  odd  and  even  derivatives  have  odd 
and  even  distributions  of  signs,  and  they  can  be  superposed  and  combined  with 
flux  limiters  to  satisfy  conditions  (3.12)  or  (3.28).  Upwind  biasing  is  then 
produced  indirectly  by  cancellation  of  terms  of  opposite  sign.  One  possible 
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starting  point  for  such  a construction  is  a central  difference  scheme  in  which 
the  numerical  flux  1 /2 ( f j + 1 + f j ) is  augmented  by  a third  order  dissipative 
flux.  This  scheme  is  the  basis  of  a method  which  has  been  widely  used  to  solve 
the  Euler  equations  of  compressible  flow  [141.  It  can  be  converted  into  an 
attractively  simple  TVD  scheme  by  the  introduction  of  flux  limiters  in  the 
dissipative  terms  1151.  The  modified  numerical  flux  retains  a symmetric  distri- 
bution of  terms  about  the  cell  boundary  j + 1/2.  The  resulting  symmetric  scheme 
is  one  of  the  variants  of  a class  of  symmetric  TVD  schemes  recently  proposed  by 
Yee  116).  Her  derivation  follows  an  entirely  different  line  of  reasoning, 
building  on  the  work  of  Davis  (171,  and  Roe  [181.  In  comparison  with  upwind  TVD 
schemes,  symmetric  TVD  schemes  offer  a significant  reduction  of  computational 
complexity,  while  exhibiting  comparable  shock  capturing  capabilities. 
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ABSTRACT 

We  continue  the  construction  and  the  analysis  of  essentially  nonosdllatory  shock  capturing 
methods  for  the  approximation  of  hyperbolic  conservation  laws.  These  schemes  share  many  desirable 
properties  with  total  variation  diminishing  schemes,  but  TVD  schemes  have  at  most  first  order  accu- 
racy in  the  sense  of  truncation  error,  at  extrema  of  the  solution.  In  this  paper  we  construct  an  hierar- 
chy of  uniformly  high  order  accurate  approximations  of  any  desired  order  of  accuracy  which  are 

tailored  to  be  essentially  nonosdllatory.  This  means  that,  for  piecewise  smooth  solutions,  the  variation 
of  the  numerical  approximation  is  bounded  by  that  of  the  true  solution  up  to  0(hR~‘),  for  0 < R 

and  h suffidently  small.  The  design  involves  an  essentially  non-osdllatory  piecewise  polynomial 

reconstruction  of  the  solution  from  its  cell  averages,  time  evolution  through  an  approximate  solution 

of  the  resulting  initial  value  problem,  and  averaging  of  this  approximate  solution  over  each  cell.  To 

solve  this  reconstruction  problem  we  use  a new  interpolation  technique  that  when  applied  to  piecewise 

smooth  data  gives  high-order  accuracy  whenever  the  function  is  smooth  but  avoids  a Gibbs 

phenomenon  at  discontinuities. 
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I.  INTRODUCTION 


In  this  paper  we  consider  numerical  approximations  to  weak  solutions  of  the  hyperbolic  initial 
value  problem  (IVP) 


u,  + /(«)*  = 0 = u,  + a{u)ux 


(1.1a) 


m(x,0)  = uG(x)  . 


(1.1b) 


Here  u and  / are  m vectors,  and  a(u ) = df/du  is  the  Jacobian  matrix,  which  is  assumed  to 
have  only  real  eigenvalues  and  a complete  set  of  linearly  independent  eigenvectors. 

The  initial  data  «0(x)  are  assumed  to  be  piecewise-smooth  functions  that  are  either  periodic  or 
of  compact  support. 

Let  v]  - vh(xj,tn),  Xj  = jh,  tn  = na,  denote  a numerical  approximation  in  conservation  form. 

v"+1  = VJ  ~ ~ h-v 2)  = (Eh  ■ vn)j . (1.2a) 

Here  Eh  is  the  numerical  solution  operator,  X = r/h,  and  fJ+V2,  the  numerical  flux,  is  a 
function  of  2k  vector  variables: 

fj+V2  = /(v"-k+l— v"+fc)  (1.2b) 

which  is  consistent  with  (1.1a)  in  the  sense  that 

/(«, «,...,«)  =/(«)  . (1.2c) 


We  shall  also  consider  a semi-discrete  method  of  lines  approximation  to  (1.1a)  obtained  by  divid- 
ing (1.2a)  by  t and  letting  a i 0 


_ 1 ^ _ (fo  “ 7)  • v)i 

- ~ “ Jj-vv Xt 


dv, 

dt  h 


(1.3) 


with  fj+-,j2  again  satisfying  (1.2.b,  c). 

The  numerical  approximation  in  (1.2)  is  considered  to  be  an  approximation  to  the  cell  average  of 


u: 
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(1.4) 


Accordingly  we  define  its  total  variation  in  x to  be: 

TV(v”)  = TV(vA(-,r„))  = 2jvy"+1  “ v]\  (1.5) 

J 

where  | | denotes  any  norm  on  Rm. 

If  the  total  variation  of  the  numerical  solution  is  uniformly  bounded  in  h,  for  0 ( s r, 

TV(v*(y)  ^ CTV(Mo)  , (1.6) 

then  any  refinement  sequence  h - 0,  t = 0(h)  has  a subsequence  h}  - 0 such  that 

v*,  - « (1.7) 

where  u is  a weak  solution  of  (1.1). 

If  all  limit  solutions  (1.7)  of  the  numerical  solution  (1.2)  satisfy  an  entropy  condition  that  implies 
uniqueness  of  the  I.V.P.  (1.1),  then  the  numerical  scheme  is  convergent  (see,  e.g.  [5],  [14]). 

For  the  semi-discrete  approximation,  (1.3),  we  consider: 

v/(0  ~ T J/+W  «(*>0d*  • (1.8) 

ft  V-V2 

' The  analogous  statements  concerning  TV  and  convergence  are  valid  as  well  in  this  case,  see, 
e.g.  [12]. 

We  shall  now  concentrate  on  the  scalar  case,  m = 1.  Extensions  to  systems  will  be  discussed  in 
sections  HI  and  V. 

Recently  total  variation  diminishing  (TVD)  schemes  have  been  designed  and  analyzed  [5],  [6]. 
There  the  approximate  solution  is  required  to  diminish  the  total  variation  (1.5)  of  the  numerical  solu- 
tion in  time: 

TV(vA(-,ti))  =s  TV(v„(-,/2)  if  h > h • (1-9) 
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These  schemes  trivially  satisfy  (1.6)  with  C = 1. 

We  were  able  to  construct  TVD  schemes  that  in  the  sense  of  local  truncation  error  are  of  high- 
order  accuracy  everywhere  except  at  local  extrema  where  they  necessarily  degenerate  to  first  order 
accuracy  (see  [5],  [6],  [12],  [14],  [15],  [17]).  The  perpetual  damping  of  local  extrema  determines  the 
cumulative  global  error  of  the  "high-order  TVD  schemes”  to  be  0(hl+iJP)  in  the  Lp  norm.  This 
improves  by  one  order  in  steady  state  calculations,  see  [1]. 

In  a sequence  of  papers  of  which  this  is  the  second,  we  show  how  to  construct  essentially  non- 
osdllatory  schemes  (ENO)  that  are  uniformly  high-order  accurate  (in  the  sense  of  global  error  for 
smooth  solutions  of  (1.1))  to  any  finite  order. 

In  the  first  paper  [7]  we  constructed  a uniformly  second-order  accurate  scheme  which  is  non- 
osdllatory  in  the  sense  that  the  number  of  local  extrema  in  the  numerical  solution  is  non-increasing. 
Unlike  TVD  schemes,  which  also  have  this  property,  members  of  this  dass  are  not  required  to  damp 
the  values  of  each  local  extremum  in  time,  but  are  allowed  occasionally  to  accentuate  a local 
extremum. 

In  this  paper  the  schemes  (1.2)  are  constructed  to  be  essentially  non-osdllatory.  Our  goal  is 
that,  if  the  initial  data  u0(x)  are  piecewise  smooth,  then  for  h suffidently  small 

TV(v„(-,r  + At))  * TV(vn(-,0)  + 0(*V+1)  (1.10) 

where  N is  the  order  of  accuracy  of  (1.2).  This  implies  that,  at  each  time  step,  the  scheme  is  non- 
osdllatory  modulo  0(/rv+l). 

The  format  of  this  paper  is  as  follows.  In  section  II  we  shall  give  the  design  prindple  and  over- 
view of  the  present  method,  including  comparisons  with  TVD  schemes.  Section  in  consists  of  certain 
variants  and  extensions  of  the  scheme  induding  extensions  to  systems  and  to  regions  with  boundaries. 
Section  IV  gives  the  interpolation  algorithm,  which  is  the  crux  of  the  method,  along  with  the  key 
result  - Theorem  (4.1).  Several  examples  are  also  given.  Section  V gives  further  analysis  of  the  inter- 
polation method  and  an  example  showing  that  general  non-osdllatory  schemes  need  additional  proper- 
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ties  (which  we  believe  to  be  true  for  the  present  methods)  to  guarantee  convergence.  We  also  analyze 
the  truncation  error  of  our  methods  in  this  section.  Proofs  of  some  technical  results  are  given  in  an 
Appendix.  We  refer  the  reader  to  references  [24]  and  [25]  for  numerical  results  using  these  methods. 


n.  Design  Principle,  Overview,  and  Examples. 

In  this  section  we  describe  how  to  construct  ENO  schemes  of  any  desired  accuracy. 

Integrating  the  partial  differential  equation  (1.1a)  over  the  computational  cell 

(*r-!/2.  X ('„>  we  get 

*T'  ~ “T  ~ " h- «(«)!  . (2.1a) 

where 

fj+ii(u)  “ “*  /(M(*y+i/2»0)^  (2.1b) 

and 

“7  = T /r  ;+W  u(x,tn)dx  . (2.1c) 

n v-w 

We  shall  also  be  interested  in  a semi-discrete  approximation  to  (1.1),  so  we  divide  (2.1a)  by  r 
and  let  t i 0: 

3 T = ~ !/(“(■*/+ 1/2.O) 

dt  J h ’ 

where  again 

«/  = 7 J (2.2b) 

n v-i n 

We  observe  that  although  (2.1a)  is  a relation  between  cell-averages  uj  and  Uj+l,  the  evalua- 
tion of  the  fluxes  fj+y2(u)  in  (2.1b)  requires  knowledge  of  the  solution  itself,  not  its  cell  averages. 

As  in  Godunov’s  scheme  [4]  and  its  second  order  extensions  [20],  [2],  we  derive  our  scheme  as  a 
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direct  approximation  to  (2.1).  We  denote  by  v"  the  numerical  approximation  to  the  cell  averages  uj 
of  the  exact  solution  to  (2.1)  and  set  vj  to  be  the  cell  averages  of  the  initial  data.  Given  v"  = {v"}, 
we  compute  vn+‘  as  follows: 

First  we  reconstruct  u(x,fn)  out  of  its  approximate  cell-averages  {vj1}  to  the  appropriate  accu- 
racy and  denote  the  result  by  L(x;vn).  Next  we  solve  the  IVP: 

v,  + /(v),  = 0,  v(x,0)  = L(xy)  (2.3) 

and  denote  its  solution  by  v(x,r).  Finally  we  obtain  v"+1  by  taking  cell  averages  of  v(x,t): 

vyfl+1  = \ J v(x,t )dx  . (2.4) 

The  averaging  procedure  is  TVD,  as  is  the  exact  solution  operator.  We  may  conclude,  there- 
fore, that  the  design  of  ENO  high  order  accurate  schemes  boils  down  to  a problem  on  the  level  of 
approximation  of  functions:  that  of  constructing  an  essentially  non-osdllatory  high-order  accurate 
interpolant  of  a piecewise  smooth  function  from  its  cell  averages. 

In  section  IV  we  shall  construct  an  essentially  non-osdllatory  piecewise  polynomial  of  order 
V,  Qv(x;w)  that  interpolates  a piecewise-smooth  function  w(x)  at  the  cell  interface  points: 

Q^(xj+V2’w)  = w(xj+v  2)  (2.5a) 

and.satisfies,  wherever  w(;c)  is  smooth 

f“  Qw(x  ± 0 ;«)  = w(x)  + O(frv+1~0,  r = 1,.„A  • (2.5b) 

The  key  result,  contained  in  Theorem  (4.1)  in  section  IV  below,  is  the  following.  For  any  piece- 
wise  smooth  function  tv(x),  there  exists  an  /i0>0  and  a function  z(x),  such  that  for  0<As  hQ: 

0s  (x^)  = z(x)  + 0(/rv+1)  (2.6a) 

TV(z)  s TV(w)  . (2.6b) 
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We  shall  use  this  polynomial  together  with  two  different  approaches  to  design  ENO  schemes. 
These  methods  are: 

RP:  Reconstruction  via  the  primitive  function. 

RD:  Reconstruction  via  deconvolution. 

We  begin  with  RP.  Let  W(x)  be  the  primitive  function  of  u(x) 

W(*)  = JT*  «(*)<&  . (2.7) 

The  lower  limit  shall  play  no  role  in  what  follows,  so  we  choose  it  to  be  a = jc_V2,  for  simplicity 
of  exposition.  Thus  since  we  wish  to  reconstruct  u(x)  out  of  its  approximate  cell  averages  vj  (drop- 
ping the  t or  n dependence)  we  have  an  approximation  to  W(x,+y2) 

J 

^+1/2)  = 2 V»  • (2.8) 

4-0 

In  each  cell  Ij\{x/xj_  1/2  ^ x < x]+  1/2} » (x;w)  is  a polynomial  of  degree  N which  interpolates 

w(xj+ L'2);  i.e.,  for  all  j 

QW(xj+V2'>w)  = w(*j+v 2)  • (2.9) 

Thus  Qv(x,w)  is  a continuuous  piecewise  polynomial,  and  both  of  d/dx  Qw(x  ± 0;w)  are  globally 
well  defined. 

Our  approximation  to  (1.1)  can  be  obtained  by  solving  (2.3)  with 

v(x,iv)  = d/dx  Qs(x;wn)  = L(x;vn)  , 

obtaining  v(x,t),  flstsr  and  then  computing  cell  averages  (2.4).  This  can  be  rewritten,  using  the 
divergence  theorem,  as: 

v/+1  - vj  — \(fj+ ]/2  ~ fj-v 2)  , (2.10) 

since 

QN(x]+V2;wn)  - QN(x/_v2;wn) 

— 1 — = vy" 
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because  of  (2.5a)  and  (2.8). 

Here  fj+  v2  is  computed  by  averaging  the  flux  function  /(«)  applied  to  v(xj+V2,t)  as  in 
(2.1b). 

In  the  linear  case: 

u,  + aux  = 0 ; (2.11) 

this  procedure  is  easy  to  carry  out.  The  exact  solution  to  IVP  (2.3)  is 

v(x,f)  = L(x  - af;v")  = Qw(x  - af;wn)  ; (2.12) 

thus  the  scheme  becomes 

vj  + 1 = (Eh  ■ v")y  = v;  - X(#+V2  - ]j-^)  = (2.13a) 

= v]  - j([G,V(^+L'2;m"’]  - £V(*y+ vi  ~ aW*)] 

-[Q'l(xj_y2,wn)  - Q^ixj. y2  - aT;wn)]| 

= \[Q?(x)+\n  ~ aT*»w")  ~ QS(X)~  1/2  ~ aT;w")] 
given  the  CFL  restriction^1) 

|aX|  =s  1 . (2.13b) 

The  numerical  flux  functions  fj+y2  defined  here  involve  values  of  £>v(x;vn)  for  x between 
Xj-V2  311(1  X]+V2  a > 0,  or  xJ+v2  and  xJ+V2  if  a < 0.  Thus,  unsurprisingly  the  resulting 
scheme  has  an  upwind  bias. 

For  general  /(«)  the  explicit  solution  to  (2.3)  can  be  difficult  to  obtain.  However,  for  N — 1, 
the  initial  data  are  piecewise  constant: 

(1The  restriction  can  be  easily  removed  in  this  constant  coefficient  case. 
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Thus  the  scheme  becomes: 


L(x;vn)  - v’j,  Xj.V2  Si<  xJ+V2  . 


v"+1  = v;n  - \[f(v(xJ+v2)  - f(v(xj_y2)]  , (2.14a) 

where  v(xy+2/ 2)  a V(xy+v2>0>  f°r  0 < t ^ t,  if  the  CFL  restriction 

|x/'(«)|  < 1 , (2.14b) 

for  all  u such  that:  min(v",vjI+1)  ^ u s max(vy  ,Vy+1),  is  satisfied. 

This  is  precisely  Godunov’s  scheme  [4],  which  is  the  canonical  three  point,  upwind,  first  order 
accurate  method  [9].  Thus  our  higher  order  methods  are  simply  generalizations  of  Godunov’s  tech- 
niques to  higher  order  ENO  schemes.  The  first  higher-order  TVD  (although  the  concept  was  not  yet 
defined)  Godunov  type  method  was  introduced  by  van  Leer  [20],  See  [8],  [2],  and  [20]  for  theoretical 
and  practical  results  concerning  such  TVD  methods.  The  difference  here,  of  course,  is  that  we  allow 
our  interpolant  to  be  arbitrarily  high-order  accurate  even  at  extrema,  and  we  replace  the  restrictive 
TVD  condition  [6],  [10],  by  the  ENO  property. 

A key  step  in  this  method  comes  in  solving  to  the  Riemann  problem  (1.1a,  b),  with  initial  data 
consisting  of  two  constant  states 

k(x,0)  ■ uL,  x ^ 0 


a(x,0)  - uR,  x > 0 . 

The  unique  entropy  condition  satisfying  similarity  solution  was  obtained  in  [9].  The  resulting  scheme 
(2.14a)  can  be  written: 

v/+1  = v/  - \(/f+v2  - ff.^)  (2.15a) 

where 


fj+vi  ~ fG(vj>vj+ 1)  — 


[ min/(«) , if  vj  vy+1 
max/(u)  , if  vy  > vJ+1 


(2.15b) 
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The  corresponding  semi-discrete  approximation  is  just: 


(2.16) 


Although  the  high-order  explicit  method  described  above  can  have  a complicated  flux  function, 
its  semi-discrete  limit  is  much  simpler.  We  merely  take  limits  as  in  (2.2a)  and  arrive  at 

jj  V,  = tfVw  - Opf),  £ 0V(*,.W  + 0;w))j  (2.17) 

-j°  {£  (Pty-n  - 0;h."),  £ e»(i/-ia  + Op**)]) 

i.e.,  Godunov’s  method  with  more  accurate  constant  initial  states. 

Next  we  use  RD.  This  time  we  begin  with  u(x)  and  denote  by  u(x)  its  mean  over 
(x  — h/2,  x + hi 2),  i.e., 

u(x)  = j JT*_^2  u(y)dy  = u(x  + sh)ds  . (2.18) 

Denote  by  uj  = u(xj),  the  cell-averages  of  u(x). 

Again,  given  cell  averages  vj  which  approximate  Uj  we  wish  to  reconstruct  u(x)  up  to 
0(/rv+i)  in  an  essentially  non-oscillatory  way.  Here  we  again  begin  by  constructing  a piecewise  poly- 
nomial interpolant  of  order  N,  which  we  again  call  (2v(*;v)»  that  interpolates  v at  x,  for  each  j: 

Q\xj)v)  = Vj  . (2.19) 


This  time  Qw(x;v)  is  a polynomial  of  degree  N in  the  interval  xj  s x < xj+u  with  possible 
jump  discontinuities  in  derivatives  at  the  end  points.  Then  we  compute  an  essential  non-oscillatory 
piecewise  polynomial  of  degree  N — 1 as  follows: 

.v-i  , 


F ’(xjv)  = vj  + 2 -r(*  ~ XjY 

r-  1 rt  • 


(2.20) 


m 


[(*)"  Q^x> " 0;v)’ 


£ cvct7 + °'v) 
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defined  for 


XJ-V2  ^ x S X1~'J2 


Here  m is  the  rain  mod  function: 


(s  min(lx|,ly|)  i 
= |0  o 


if  sgnx  = sgny  = s 
otherwise 


(2.21) 


This  gives  us  our  approximation  to  v,  which  may  have  discontinuities  at  each  Xj+in-  We  use 
this  to  obtain  an  approximant  to  m(x)  via  a "deconvolution"  procedure.  We  have  approximate 
derivatives  to  «(x)(2): 


[hi ] awu, = W,  - <w. 


[i]  + °-v)) 


+ 0(Ar+1),  r = 0,1,...  A - 1 . 


At  points  of  smoothness,  we  have 

(*£)  = /-L-2  (‘f ) + ^ 


N-k- 1 , ( , )fc+r  1/2 


for  ifc  = 0,1,...,#  — 1. 

Thus  we  may  write  the  ToSplitz  upper  triangular  matrix  equation: 


(2.22) 


(2.23) 


(2)This  will  be  shown  for  piecewise  smooth  u(x)  in  section  IV. 
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«(*/) 

hit  “(xj) 


(2.24) 


This  is  easily  inverted  and  gives  us  each  of  the  terras  ( hdldxf  u(xj),  up  to  OQrN). 

We  replace  the  left  side  of  (2.24)  by  the  approximations  on  the  corresponding  right  side  of 
(2.22)  for  each  v.  We  invert  this  system  in  (2.24)  and  call  the  computed  approximate  values 
(hvd/dx)vv(x}). 

For  xj.v2  s x < xJ+V2,  we  write  our  approximation  as 

We  need  the  following: 

LEMMA  (2.1) 

The  cell  average  is  preserved  under  this  operation,  i.e.: 

JL  J/+ W L.v- !(.*;„)<&  = u (2.26) 

n 
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Proof 


A direct  computation  gives  us: 


i £*->(**)&  = '2‘ 

n v-w  ■ v-0 


/- 

> 

' N. 
> 

•4 

i 

1 - (-l)v+1 

K'dx) 

2v(v  + 1)! 

2 

= Ut 


from  the  first  row  of  (2.24). 

Now  we  confine  our  scheme  construction  as  we  did  using  RP.  In  the  RD  approach  we  approxi- 
mate (1.1)  by  solving  (2.3)  with  v(r,0)  = Z,,v-1(x;0)  = L(x;vn)  and  proceed  as  above.  We  again 
arrive  at  (2.10).  In  the  linear  case  (2.11)  the  resulting  numerical  fluxes  are  defined  via 

fj+V2  = a Iq  lS~K*]+V2  ~ asr;vn)ds  , (2.27) 

given  the  CFL  restriction  (2.13b). 

Also  the  semi-discrete  algorithm  for  general  / obtained  via  RP  in  (2.17)  is  replaced  by  its 
analogue  with  the  numerical  flux 

fG(Lrf~1(xj+V2  - 0),  Ln~1(xj+v2  + 0))  . (2.28) 


III.  Variants  of  the  Scheme. 

The  exact  solution  to  the  special  initial  value  problem  (2.3)  can  be  difficult  to  compute.  This  is, 
of  course,  particularly  true  when  the  initial  data  is  a piecewise  polynomial  of  degree  higher  than  zero, 
but  is  also  usually  true  for  general  systems  of  equations  for  piecewise  constant  initial  data,  i.e.,  for 
Godunov’s  method.  One  can,  however,  obtain  a convergent  power  series  expansion  for  this  solution 
see  [22],  [23]. 

Godunov’s  method  is  canonical  in  the  class  of  (scalar)  E schemes,  defined  in  [9].  A consistent 
numerical  flux  yields  a semi-discrete  E scheme  iff 

[sgn(uy+i  - uj)]fj+v2  £ [sgn(uy+1  - My)lf+V2  , (3.1) 

or  equivalently,  iff  its  viscosity  is  greater  than  or  equal  to  that  of  Godunov’s  method  [18].  E schemes 
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are  TVD  and  entropy  condition  satisfying;  thus  they  always  converge  to  the  correct  physical  solution 
[19],  [18].  Examples  include  the  Engquist  - Osher  scheme  and  entropy  corrections  of  Roe’s  scheme  - 
see,  e.g.  [3],  [16]. 


One  property  all  E schemes  share  is  the  fact  that  they  can  be  obtained  by  averaging  a solution  to 
a Rieraann  problem  over  each  cell,  where  / is  replaced  by  an  approximation  /,  in  equation  (1.1)  - 
see  [18],  [14].  Thus  they  retain  the  ENO  property.  We  may  let  ff+V2  = fE(vJ,vJ+l)  be  any  two 
point  E flux  and  generalize  our  semi-discrete  algorithm  (2.17)  to: 


-1 

h 


\f 


&w(xj+ 1/2  - °;  O.  Qw(xj+y 2 + 0; »") 


/ 

~f  (Fty-vi  ~ 0;  O.  j;  Q\*j-v i + 0;  w») 


(3.2) 


We  may  generalize  (2.28)  analogously. 

Next  we  replace  the  exact  numerical  flux: 

fj+irz  ~ ~ f0  f(v(xj+ V2>  0)<*  (3.3) 

by  an  approxunation  based  on  a Taylor  series  as  follows. 

For  *y_i/2  < x < Xj+v29  we  can  compute  the  quantities 

(£)'VM) 

for  v the  solution  of  (2.3),  by  using  a Lax-Wendroff  type  of  procedure. 

For  example: 

f M)  = ^/(vM)) 


dxdt 


-/'(«(*. 0))(«ft(x,0))2  - f (u (x ,0))«xt (x ,0) 
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(3.4) 


= -nu(x.O))^  (x,0)%(xfi)  - ffy(x,0))  . 


Next  we  write  an  approximation  to  v(x,r): 


v*(*,0  = v(x,0)  + r (x,v)+-+  jf 


Now  we  replace  the  integral  in  (3.3)  by  a quadrature  rule 


/ /(v(xj+v2;  os)ds  ~ A0f(y(xJ+^,s^))  +•••+  Ak  f(y(xJ+v2,sk)) 


(3.5) 


for  0 s j0  < s^-’  < sk  s 1 . 

Finally,  we  define  each  value  of  / above  as: 

/(v(JC7+l/2>,rr))  = fG(v(,xj+lf 2 “ 0)Jr)>  v(x/+V2  ®>^r))  (3.6a) 

if  we  base  our  approach  on  Godunov’s  method.  More  generally  we  can  replace  Godunov’s  flux  by  its 
generalization. 

fiy{x]+yz,Sr))  = fE(y(Xj+v2  - °.Jr).  v(*/+l/2  + °.Jr))  • (3’6b) 

Thus  we  approximate  (3.3)  by  a sum  of  piecewise  constant  Godunov  methods,  or  approximate 
Godunov  methods,  evaluated  at  several  time  layers.  The  quadrature  rule,  and  the  value  of  R , deter- 
mine the  order  of  time  accuracy  of  this  method. 

We  note  that  this  approximation  need  not  preserve  the  essential  non-osdllatory  property. 
Nevertheless,  due  to  the  (nonlinear)  nature  of  our  ENO  interpolant,  the  method  works  well  numeri- 
cally, as  is  seen  from  the  results  in  [24]  and  [25]. 

Next  we  consider  hyperbolic  systems  of  conservation  laws  (1.1).  In  the  linear  case,  /(«)  = Au, 
where  A is  a constant  matrix  with  a complete  set  of  right  and  left  eigenvectors  r(v\  l(v\  correspond- 
ing to  real  eigenvalues  \(v),  for  v = l,...,m.  We  proceed  formally  as  in  (2.1),  (2.2),  (2.3),  and  it 
just  becomes  a matter  of  computing  the  vector  valued  function  L(x;vn)  — v(x,0)  in  (2.3). 
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We  decompose  an  arbitrary  m vector  w as 


m m 

w = V (p)  • w)r<y)  = 2 wW 

v-i  V- 1 

using  the  usual  /2  inner  product.  These  are  used  to  construct  L(x;vn)  again  via  the  RP  or  RD 
reconstruction  approaches. 

The  RP  approach  proceeds  by  computing 

”',V)CW  = 2 4v)h  • 

£-0 


Then  we  proceed,  as  in  the  scalar  case,  to  compute  each  of  C'v(*»wW),  and  finally  by  letting 


L(x;vn)  = 


m 


2 


4-  Q‘\x]w(v)'n) 

ax 


(3.7) 


The  RD  approach  begins  by  computing  which  is  a piecewise  polynomial  interpolant  of 

order  N that  interpolates  v(v)  at  each  Xj.  The  rest  of  the  reconstruction  procedure  is  done  as  in  the 
scalar  case,  and  finally  we  replace  (2.25)  by 


I,v-1(r;v)  = S L'v-1(*;v^v))r(v) . 

v-l 


For  nonlinear  systems  we  denote  by  Ay  = df/du  (vy),  the  Jacobian  matrix  evaluated  at  vy,  and 
define  \jv),  ljv\  and  rjv)  in  the  usual  fashion.  This  time  we  decompose 


v 


m 


= 2 


(5 


v)'i 


= 2 


„Wo 


(3.8) 


For  each  v and  each  y0,  we  shall  construct  an  ENO  scalar  interpolant  such  that,  in  the  cell 
Xj  s x < Xj+V  QVJa+°(x,v<v'>)  is  the  unique  Nth  degree  polynomial  that  interpolates  v(l  ^(xy)  for 
j = 70,  j0  + 1 and  W — 1 neighboring  points  as  defined  in  section  IV,  and  °(x,v^))  which 
interpolates  v(vJ^(xj)  for  j = j0,  jQ  - 1 and  the  appropriate  N - 1 neighboring  points. 
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We  then  construct  the  m-vector  valued  ENO  piecewise  polynomial  of  degree 


N - 1 as  follows: 


1 


/•v  ^xjv)  = vj  + 2 X -7 -r(x-  XjYm 

H.-1  v-1  P1 


g-Vj#+°C^  + 0; 


V(v)) 


(3.9) 


for  Xj_y2  =s  x < xJ+v2. 

We  may  then  deconvolve  precisely  as  in  the  scalar  case  and  arrive  at  a vector-valued  version  of 
(2.25).  Moreover  Lemma  (2.1)  is  still  valid. 

The  RP  approach  is  done  analogously. 

Thus  using  either  RP  or  RD  we  have  enough  information  to  compute  the  vector  valued  analogue 
of  (3.2)  - the  semi-discrete  algorithm.  This  time  the  canonical  method  is  again  Godunov’s  which  uses 
the  exact  solution  to  the  Riemann  problem.  Other,  simpler  approximate  Riemann  solvers  may  be 
used  - e.g.,  Osher’s  [13],  van  Leer’s  for  the  Euler  equations  of  compressible  gas  dynamics  [21],  or 
Roe’s  [15]  with  an  entropy  fix  as  suggested  in  [16],  [17]. 

The  explicit  vector-valued  construction  follows  the  procedure  of  (3.5),  (3.6),  again  using  perhaps 
one  of  the  approximate  Riemann  solvers  to  replace  Godunov’s  method. 

Various  simplifications  of  these  procedures  are  possible  and  will  be  discussed  in  future  papers. 

Next  we  discuss  the  influence  of  boundaries  on  our  procedure. 

We  illustrate  the  idea  by  considering  the  linear  equation  (2.11),  with  a 0,  to  be  solved  for 
t,x  > 0,  with  initial  data  of  compact  support.  If  a > 0,  then  a physical  boundary  condition 
n(0,r)  = g(t)  must  be  given.  If  a < 0,  then  no  physical  boundary  conditions  are  needed. 

The  modifications  needed  are  two-fold: 


(1)  At  points  sufficiently  near  the  boundary  our  ENO  interpolant  will  lack  a choice  of  least 
oscillatory  direction.  We  will  choose  only  among  interpolation  points  which  lie  inside  the  region.  This 
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procedure  has  not  led  to  stability  problems  in  our  numerical  experiments.  This  can  be  explained  by 
the  adaptive  nature  of  the  stencil  in  the  interior.  However,  in  situations  where  discontinuities  flow 
into  or  out  of  boundaries,  oscillations  may  develop.  These  oscillations  do  not  seem  to  pollute  the  solu- 
tion globally  according  to  our  (now  rather  extensive)  numerical  experimentation.  We  regard  this  as 
essentially  the  same  problem  that  we  have  when  discontinuities  intersect  in  the  interior.  We  shall  dis- 
cuss these  matters  in  future  papers.  Some  relevant  numerical  experiments  are  presented  in  [24]. 

(2)  Instead  of  an  initial-value  problem,  at  x = 0 we  solve  an  initial-boundary  value  problem. 
This  is  easy  in  the  scalar  case  - if  a > 0,  we  just  use  the  given  boundary  condition,  and  if  a < 0,  we 
need  no  boundary  condition  since  the  wave  propagates  to  the  left. 

For  general  systems  of  equations  we  follow  the  same  procedure,  i.e.,  interpolating  in  the  interior 
directions  when  forced  to,  and  solving  an  initial-boundary  Riemann  problem  - perhaps  approximately. 
See  [10]  for  more  details  about  the  latter. 

One  variant  of  the  scheme  which  we  do  not  recommend  involves  interpolation  of  the  fluxes  to 
obtain  a high  order  method.  This  was  done  in  [11]  in  a TVD  context,  and  schemes  of  arbitrarily  high 
order  away  from  critical  points  of  the  function  /(«)  in  (1.1)  were  obtained.  One  might  think  that 
our  ENO  interpolant  might  be  used  on  the  fluxes  using  the  decomposition  of  an  £ scheme  into  its 
"upwind"  and  "downwind"  parts 

dfj+L'2  ~ ff+V2~  f(vj) 

dfj+V2  = f(vJ+\)  ~ ff+V2 

as  in  [11].  The  difficulty  here  occurs  because  of  the  lack  of  smoothness  of  f6  which  generally  occurs 
at  sonic  points.  This  degrades  the  accuracy  to  be  at  most  third  order  in  Ll  at  sonic  points,  if,  e.g., 
the  Engquist  - Osher  flux  is  used,  and  second  order  for  Godunov’s  or  Roe’s  methods. 


IV.  Essentially  Non-Osclllatory  Interpolation  and  Some  Examples 
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Consider  a scalar  mesh  function  {vy}y*__w. 

We  let  Q(x\v)  be  an  interpolant: 

Q(xj>v')  vj  j •••” 

Xj  = jh,  h > 0 . 

We  shall  study  a special  piecewise  polynomial  interpolant  of  degree  N,  £>v(*;v), 
sively  as  follows: 

Definition  (4.1) 

Q\x\v)  = Vj  + (x-  xj)  ^ , XjSx<  Xy+1 


= v[xj]  + [x-  Xj]  v[Xj,  XJ+[]  , 


where  v[xy  ,...,*y+v]  denotes  the  usual  coefficient  in  the  Newton  interpolant. 
We  also  define: 

*&  - y.  - j + 1 . 


Suppose  we  have  defined  Qs  ^jc.v)  for  xj  ^ x < xJ+l,  and  that  we  also  have 
*gfn  J).  Then  we  compute 

min  mu  1 

nun  A ^mtx 

and  proceed  inductively. 

If  |o-v|  s |frv|,  then 


<2v(^;v)  = £>v-1(*;v)  + n (*  - xK) 

jc-jrffl"1) 

**■ 


(4.1) 


defined  recur- 


(4.2a) 


(4.2b) 


(4.2c) 


(4.2d) 
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with 


£(-Y)  = ArC-Y"1)  - 1 

"■tan  ■"•nan  i 


(4.2e) 


Or  if  |<rv|  < |Zrv],  then 


Qn(x\v)  = Qw~\x\v)  + o’  II  (x  - xK)  (4.2f) 


with 


KW  - K{S~^  + 1 

"•max  "max  T 1 • 

Thus,  in  each  cell  x}  s»  x < jcy+1,  we  have  constructed  a polynomial  of  degree  N which  inter- 
polates v(x)  at  N + 1 consecutive  points  which  include  x}  and  xJ4.v  It  is  designed  so  that  all  its 
derivatives  are  as  small  in  absolute  value  as  is  possible,  given  the  above  constraints. 


Remark  f4.ll 

This  interpolant  can  introduce  small  oscillations  of  order  Jrv+1  even  for  monotone  and  smooth 
data  v(x). 

We  use  the  following: 


Example  ^4.1^ 

Let 

v(*)  = X3 

(4.3a) 

N « 2 

(4.3b) 

Xj  = (j~  1/2)A  • 

(4.3c) 

The  interpolant  Q\x%,v)  for  x between  xx  and  x2  will  interpolate  v(x)  at  xu  x2,  x3. 
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We  rescale,  letting 


*'  = f + i «•«*> 

vW  = + i . (4.4b) 

We  get 

<2V;v')  = —5x'  + 6(x’)2  (4.4c) 

so  a new  extrema  occurs  at  x'  = 5/12  i.e.  at  x = -A/12.  The  magnitude  is  0(h3)  in  the  unsealed 
variables. 

Our  next  result  shows  that  this  is  the  worst  possible  case  for  N = 2. 

In  fact  for  piecewise  smooth  data  and  A sufficiently  small  the  largest  possible  spurious  oscilla- 
tions for  Qs  will  be  0(/rv+1). 


THEOREM  (4.1) 

For  any  piecewise  smooth  v(x),  possibly  having  jump  discontinuities,  there  exist  an  A0  > 0 
and  a function  z(x),  such  that,  for  all  As  A0 

j 2v(x;v)  = z(x)  + 0(A-V+1)  (4.5a) 

where 


TV(z)  s TV(v) 


and  we  repeat: 


Qw(xpv)  = v(xj),  j = 0,  ±1,  ±2,...  . 


PROOF 

Consider  the  interval  x,  s x < Xj+1  and  study  two  cases; 


(4.5b) 


(4.5c) 
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(i)  v is  smooth  in  [xj,  Xy+1] 

(ii)  v has  a jump  discontinuity  in  [xJy  Xy+1]. 


Case  (i):  If  v is  smooth  over  the  full  interval  of  interpolation  fo.™,  x.^],  standard  interpolation 

results  imply  Qw  = v + OQ rv+1),  so  we  then  take  z(x)  - v(x).  Otherwise,  for  hQ  small  enough, 
there  exists  an  interval  containing  N + 1 consecutive  parts  such  that  all  divided  differences  w[  , , ] 
involving  points  in  this  interval  are  bounded  independently  of  h.  We  call  the  point  at  the  extreme 
right  x^y  If  [x^,  xJfW+1]  contains  a discontinuity  in  x,  then 


vI*jc.-^w+1] 


v(*jKW+i)  ~ v(yo) 

m\hm 


+ 0(h~m+l)  , 


(4.6a) 


where 


m = KW-K  + 1 . 


(4.6b) 


Ihis  follows  from  the  explicit  form  of  v[  ]. 

Hence  the  definition  of  Q^(x;v)  guarantees  that,  for  h small  enough,  there  will  be  no  discon* 
tinuity  of  v(x)  in  the  interval  of  interpolation  [x^,  x^ggi].  The  result  above  is  still  valid: 

Qs  = v + 0(/rv+1). 


Case  (ii):  We  may  suppose  h0  is  small  enough  so  that  v(x)  has  only  one  discontinuity  in 
xrW  ]>  ^ is  in  [xj,  x,+1).  For  a given  interval  of  interpolation  we  may  decompose: 

min  mix 

v — w + H (4.7) 

where  w is  Iipschitz  continuous  and  H is  piecewise  constant  with  a single  jump  which  occurs  in 

[•*/>  xj+ 1)- 

We  have  in  [x},  xJ+l): 

Q”(x;v)  = Qw(x;w)  + Qw(x-,H) , (4.8a) 
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where: 


and 


Q^(x\w)  = 2 v[xv.,...rc  w]  II  (j:  — jcu)  , (4.8b) 

.._«-(M  um>+' 


s C (4.8c) 


(where  C always  denotes  any  universal  positive  constant). 
This  implies  that 


j-  Qw(x;w) 


(4.9) 


By  Rolle’s  Theorem  the  interpolant  Q^ix'Ji)  of  the  piecewise  constant  function  must  have  an 
extremum  in  every  interval  (x„,  x„+1)  for  v + j,  AfW  ^ r ^ AT'-JO.  This  makes  a total  of  N — 1 
extrema.  Since  the  interpolant  is  of  degree  N,  it  must  be  monotone  in  [xj,  Xy+1]. 


Unis,  for  h — 1,  we  have 


max 

xj****)+ i 


20  0 


(4.10a) 


For  general  h,  the  scaling  gives 


max  Qv(x-,H) 

',s'sVi  “* 


£ 

h ' 


Thus  (4. 8)-(4. 10)  imply  that  2“v(x;v)  is  monotone  in  [xj,xJ+1].  We  take 

z(x)  = (?v(x;v)  . 


On  the  interval  [x},  Xy^-J 

TV(gv(x;v))  = | Vy*f. ! - Vyl  s TV(v)  . 

The  theorem  is  proven. 


(4.10b) 


(4.11) 


(4.12) 
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We  also  have 


Remark-02) 

Let  v(x)  be  piecewise  polynomial  of  degree  ^ N.  Then  in  any  interval  [xy,  Xy+1]  in  which 
v(x)  is  not  discontinuous  the  interpolant  is  exact 

&v(*;0)  “ v(x) . 


We  now  compute  "second"  and  "third”  order  accurate  approximations  to  the  linear  problem. 

u,  = -ux  (4.13) 

Using  RP  for  N — 2,  we  have,  for  2 ^ x < */+i/2  > 

Q\x;w)  = wj_hr2  + (x  - xj.y^vj  + (4.14a) 


+ 


1 (*  ~ *y-i/2)(*  ~ *7+1/2) 

2 h 


m(A_y/,  A+vy) 


where 


m{x,y ) = x if  \x\  ^ |y| 


(4.14b) 


m(x,y ) = y if  |y|  > \x\ 
and 

ATVy  = =F(vyTl  - Vy)  . (4.14c) 

The  algorithm  becomes 

vj+ 1 = v f - XA-IyJ1  + p ~—  |m(A_Vyn,  A+vj1)]  . (4.15) 

This  is  a TVD  scheme  [6]  for  X < 1 which  is  second  order  accurate  with  a first  order  degen- 
eracy at  critical  points. 

For  N = 2 with  the  RD  approach,  a simple  calculation  gives  the  algorithm  for  |X|  < 1: 
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\ 

vn+\  — v?-XA_[v"+  ■—  — m[A_v"+-^-m(A_A4.Vy,  A.A.Vy1), 

k / 


A+vj1  - jm( A.Ao-Vy",  A_A+Vyn)]]  . 


(In  [7]  we  obtained  a similar  algorithm,  with  both  of  the  m replaced  by  m.  We  proved  that  the 
scheme  in  [7]  was  truly  non-osdllatory.) 

The  scheme  (4.6)  is  truly  second  order  accurate,  even  at  critical  points  and  converges  for 
|\|  < 1,  at  least  according  to  extensive  numerical  tests. 

Using  N = 3 in  the  RP  approach  gives  us  for  xy_V2  ^ x < xJ+h7: 

If  |A_Vy|  £ |A+Vy| , then:  (4. 


Q\X;W)  = Wj_v2  + (X  - Xy_L7)Vy  + 


1 (■*  ~ Xi-b'Jjx  ~ 3+1/2) 


+ -gij(x  - Xy_S7)(x  - xy-^(x  - Xy+y2)m(A_A_Vy,  A_A*Vy) 


If  |A_v.|  > |A+Vy|  , then 


Q3(x;w)  = Wj  — + (x  ~ xj+vdv]  + T 


1 (X  ~ X,_L^)(X  - Xy+y2) 


+ ~7  (*  “ xj-V2)(.x  - x)+\rXx  - A+A+Vy)  . 


Then  our  numerical  scheme  becomes  for  |X|  < 1: 

Vy  J’1  = Vy  — XA_[v"  + ^ ^ m(A_Vy,  A+Vy) 


-i-  (X  - 1)(X  - 2)m(A_A_Vy,  A_A+Vy"),  if  |A_Vy"|  =s  |A*v;| 
jr  (X  - 1)(X  + l)m(A_A+Vy,  A+A+vj1)  , if  |A_Vy"|  > |A+Vyn| 
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This  scheme  is  third  order  accurate  except  perhaps  at  points  where  ux  or  uxx  = 0,  at  which  it 
may  degenerate  to  second  order  accuracy. 

For  N = 3 using  RD  we  have  for  Xj  x < xJ+l: 

Q\x;v)  = vj  (4.20) 


(x—xi)  (x  — xf)(x—  x/+1)  _ 

+ — A +Vj  + L—L — m(A_A+vy,  A^A + vy) 


1 (*“*/- i)(*-*y)(*-*/+iMA_A_A+vy,A_A+A+vy), if  |A_A+vy|=s|A+A+vy| 

6A3  (jc-xy)(x-xyx1)(x-j:y*2)m(A-A+A+vy,A+A+A+vy),if|A_A+vy|>|A+A+vy|  ' 

We  can  derive  a globally  third  order  accurate  scheme  by  using  (2.20),  (2.24),  (2.25),  and  (2.27). 
We  omit  the  details  here. 

It  should  be  stressed  that  our  algorithms  are  to  be  obtained  recursively  using  the  computer.  We 
have  written  down  a few  numerical  fluxes  here  just  to  give  the  reader  some  idea  of  what  they  look 
like. 


V.  FURTHER  THEORETICAL  RESULTS  AND  EXAMPLES 

While  Theorem  (4.1)  is  encouraging  in  that  it  shows  us  that  the  interpolant  Qs  is  indeed  essen- 
tially nonoscillatory,  more  analysis  needs  to  be  done.  The  schemes  designed  in  section  II  do  not  use 
this  function  in  a simple  enough  fashion  for  us  to  prove  the  desired  estimate  (1.10),  even  if  vh(x,t)  is 
piecewise  smooth. 

As  a step  in  this  direction  we  consider  the  method  based  on  RP  applied  to  a piecewise  continuous 
function.  A canonical  example  involves  the  interpolant  £>v(*;g),  where  g(x)  is  the  primitive  of  a 
Heaviside  function  normalized  to  be: 

g(x)  = a - x,  x ^ a (5.1a) 
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g(x)  = 0,x>  a 


(5.1b) 


for  0 £ a < 1. 

We  let  h = 1 and  compute  the  least  oscillatory  piecewise  polynomial  Qw(x,g)  which  interpo- 
lates g(x)  at  x - j for  each  integer  j.  By  Remark  (4.2)  we  have 

Qw(x-,g)  =•  g(x)  for  xsO  and  x^l.  (5.2) 

We  need  only  compute  Qv(x;g)  for  0 < x < 1.  We  wish  the  reconstructed  function 
d/dx  Q?(x,g)  to  be  a non-osdllatory  approximation  to  g(x).  This  reduces  to  showing  that  on 
Osxsl 

-1  Q\x\g)  0 (5.3a) 

Qw(x;g)  s=  0 . (5.3b) 

The  least  oscillatory  polynomial  on  the  interval  0 ^ x ^ 1 will  be  one  of  the  N + 1 polynomi- 
als of  degree  N,  Qx(x;g),  which  interpolates  g(x)  at  the  N + 1 consecutive  points 

{K  - K - N + 1 0, 

for  1 s K =s  N. 

■ In  the  proof  of  Theorem  (4.1)  we  showed  that  any  polynomial  which  interpolates  the  derivature 
of  this  function  g'(x)  through  these  N + 1 points  is  monotone  on  the  interval  Osxsl  In  con- 
trast we  have 

Example  ('5.1'! 

/-tV  _ / \ , (x  + N — l)(x  + N — 2)..JC  r-i  1 /e  A \ 

Q\  = (a  - X)  + J L [1  - a]  (5.4a) 

thus: 

- -1  + (1  - a)[l  + ...  + i]  > 0 . 
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for  N( a)  sufficiently  large,  when  a is  fixed:  1 > a > 0. 

Thus,  in  order  for  the  inequalities  (5.3)  to  be  valid,  we  need  the  special  properties  of  the  least 
oscillatory  interpolant  of  g(x).  We  have: 

Lemma  (5.1) 

The  least  oscillatory  polynomial  of  degree  N is  Ql  iff 
1 - K/N  £ a < 1 - (K  - 1 )/N,  K = 1,2,. ..ft. 

Finally  we  have: 

Lemma  f5.21 

The  polynomial  obtained  in  the  statement  of  Lemma  (5.1)  satisfies  the  inequalities  (5.3). 

We  shall  present  the  proofs  of  these  claims  in  the  Appendix. 

Next  we  consider  the  method  based.on  RP  applied  to  a smooth  perturbation  of  a Heaviside  func- 
tion g'(x).  We  find  here  two  new  problems. 

(1)  The  error  between  d/dx  Q^(x;g)  and  g'(x)  in  the  cell  next  to  the  interval  containing  the 
discontinuity  need  not  be  OQ rv)  - it  can  be  as  bad  as  0(h)  for  N > 1. 

(2)  The  variation  in  this  cell  can  increase  - i.e.,  Wai[d/dx  Qw(x;g)]  in  this  cell  can  exceed  that  of 
g'(x)  in  this  cell  by  0(h2)  for  N > 2. 

On  the  plus  side  we  note  that  these  are  somewhat  pathological  examples,  that  the  error  and 
growth  in  variation  are  indeed  decaying  with  h,  and  that  two  cells  away  from  the  discontinuity  all 
seems  well  in  that  the  error  and  possible  variation  growth  appear  to  be  OQr1).  Nevertheless  we 
expect  to  investigate  other  ENO  interpolation  procedures  as  well  as  alternative  reconstruction  tech- 
niques, with  an  aim  towards  removing  these  (hopefully  minor)  problems. 
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Example.  (5.2) 

Let 


g(x)  = + a(x  + gh),  x > _g/, 

g(x)  — -x  — Bh,  x ss  -Bh  for  1 > B > 0 . 

Then  the  function  we  are  approximating,  g'(x),  satisfies 
g'(x)  = — 1,  x ^ -Bh 

g'(x)  = x + Bh  + a x > -Bh  . 

We  shall  obtain  &v(x;g)  which  interpolates  g(x)  at  grid  points  x}  = jh,  j = 0,±  1.  We  are 
interested  in  Q?  for  0 s r s A.  We  shall  arrange  a and  B so  that 

-f  Qv(x;g)  for  N = 2 and  3 
ax 

both  have  0(h)  pointwise  error  compared  to  that  of  g'(x)  on  this  interval. 

We  do  this  as  follows: 

For  0 SrsA: 

QK*>8)  = 8(0)  + f(g(h)  - g(0))  . 

Next  we  arrange  a and  B so  that  the  three  consecutive  points  (—h,g(—h)),  (0,(g(0)),  and 
( h,g(h ))  are  collinear: 

g(h)  - 2g(0)  + g(-h ) = 0 
or 

g(-h)  = h2  - - ah(B  - 1)  = A(1  - B) 

or 


(5.4a) 

(5.4b) 

(5.5a) 

(5.5b) 


(5.6a) 

(5.6b) 
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(5.6c) 


•|(fl  - l)2  + (a  + 1)(B  - 1)  - h = 0 . 


We  solve  this  obtaining 

B(h,a)  = 1 + — ~ + 0(A2)  , (5.6d) 

a + 1 

and  since  we  want  Osfl  s 1,  we  take  a < —1. 

Thus  we  have  for  0 ^ x s h 


Q\x\g)  = Q\x,g)  , 


which  interpolates  g(x)  at  x = -A,0,A  and 


which  clearly  differs  from  g'(x)  by  0(A)  at  some  points  in  this  interval. 

We  also  claim  that  d/dx  Q3(x,g)  - g'(x)  is  0(A)  in  this  interval  as  well.  It  is  easy  to  see  that 
Q3  will  be  chosen  to  interpolate  g(x)  at.jr  = — A,0,A,  and  2A.  Thus  in  our  interval  of  interest: 


C’Crw)  = (?(.»)  + fe  - + *> 

6 


^ + aA(5  - 1)  + A (5  - 1) 


Thus 


T7  C3(^;s)  = 77(8  - 1) 


dr2 


A2 


and  then 


.s.  £ ^ - « 


~ ^ +(!+*) 


= f + 0(J2) 


and  the  error  is  still  0(A)  since  var0StSA  g’(x)  = A. 

Our  next  example  well  allow  for  an  increase  in  variation  in  this  cell,  although  it  will  still  decay 
with  A as  A 1 0.  Let: 
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*(*)  = + b(x  + Bh),  X > -Bh  (5.7a) 

g(x)  = -x  - Bh,x  * -Bh  (5.7b) 

for  1 > B > 0 . 

Then  the  function  we  approximate  is: 

*'(*)  - -1,*  ^ -Bh  (5.8a) 

8' (pc)  = + + b . (5.8b) 


This  time  we  want  the  points  (-h,g(-h)),  (0,5(0)),  (A,g(/»))»  and  (2 h,  g(2h))  to  all  lie  on 
the  same  parabola.  This  means  that 

O = h3  + h3  ^ + (1  + b)h(B  - 1)  (5.9a) 

o 

or 

s - 1 - 7TT  (5-9b) 

Thus  we  take  b > -1. 

On  the  usual  interval  0 sjsA  we  have 

Ql(x;g)  = ^(0)  + ~ ( g(h ) - 5(0))  (5.10a) 

Q2(x;g ) = Qx(x;g)  + h)  \s(h)  - 2g(o ) + g(-h)]  (5.10b) 

and  by  (5.9): 

Q\x\g)  - Q2(x;g)  (5.10c) 

The  function  g'(x)  is  monotone  on  the  interval  Osxs/i  as  long  as  b is  not  0(h2)  so: 
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while 


var 

0=St^A 


var  -f  OW)  = - W -1-  ^°)l  - hK  1 + J) 

Os.tsA  ax  n 


Thus  an  oscillation  of  order  h~/ 2 is  induced  in  what  should  be  a third  order  method. 

We  note  that  the  discontinuity  in  g'(x)  is  rigged  so  that  it  occurs  at  a distance  0(h2)  from  a 
grid  point.  This  is  a bit  pathological,  but  is  certainly  possible. 

This  oscillation  is  maintained  even  when  we  increase  the  order.  For  example,  in  the  same  inter- 
val it  can  be  easily  shown: 

<«**)  = QHx;s)  + ± *>  [*(-*)  - |(-*)] 


where  g(x ) is  the  continuation  of  the  cubic  polynomial  g(x)  to  x negative. 
Thus 


«(“*)  - g(-h ) = A(1  - B)  - 


4-(B  - l)3  + bh(B  - 1) 
o 


« h(  1 + h)(l  - B)  + (1  - 5)3  = h2  + 0(h9>i 

o 


A simple  calculation  gives  us 


var 

Q^x^h 


QVa)  = ft2 


+ Of*!)  , 


and  we  again  have  a variation  increase  0(h2)  in  this  interval. 

Next  we  show  that  a scheme,  which  is  non-osdllatory  for  relevant  data  in  that  new  extrema  do 
not  develop  on  the  initial  data  as  h increases,  can  still  be  extremely  unstable. 


Example  5.4 
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We  approximate 


u,  = -ux  (5.11) 

by: 

v""1  = v;  - XA_m(v"+1,v")  . (5.12) 

We  take  as  initial  data 

vf-Ojs-l  (5.13a) 

vg  = a (5.13b) 

v?  = e - a (5.13c) 

vf  - 0, ; a 2 (5.13d) 

for  0 < e « a,  0 < X < 1/2. 

An  explicit  computation  gives  us: 

vj  * vj  if  j — 1,  j a 2 (5.14a) 

Vjj  = a(  1 + X)  — Xc  (5.14b) 

vj  = (£  - a)(l  + X)  . (5.14c) 

Thus  the  "shape"  of  the  initial  data  is  invariant  in  time  and 

vg  -® 


Now  we  analyze  the  truncation  error  TE  for  our  two  methods.  We  begin  with  RP  applied  to  the 
linear  equation  (2.11)  and  arrive  at  (2.13).  In  this  case  a precise  expression  for  TE  is: 


- Q%+v 2 - aj-,W)  + W(xJ+v2-ar)] 


We  recall 

W(x)  = / «(*)<&  with  (5.15b) 

W(x)+ui)  = Q?(Xj+ vi,w)  (5.15c) 

and 

(^•j  Q(*W)  - W(x)  = 0(/rv+1-v)  (5.15d) 

in  regions  where  lV(x)  is  sufficiently  smooth. 

It  is  dear  that  the  TE  is  £?(frv)  as  long  as  the  coeffident  multiplying  the  /rv+1_v  term  is  dif- 
ferentiable when  for  v = 1.  This  will  be  true  in  general  if  the  stencil  of  points  used  for  the  interpo- 
lant  in  two  consecutive  intervals  is  invariant  under  translations.  This  is  true  in  smooth  regions  if  none 
of  the  derivatives  of  u(x)  up  to  order  N — 1 vanish  in  a neighborhood  of  this  interval. 

We  thus  have 


Theorem  f5.1^ 

TE  for  the  explidt  and  semi-discrete  methods  based  on  RP  approximating  a linear  equation  is  of 

order 

TE  = 0(kw),  if  u(x ) * 0,  r = 1,2,. ...Af  - 1 (5.16a) 

TE  = 0(/rv_1)  otherwise  . (5.16b) 

For  the  full  nonlinear  problems  the  algorithm  (2.14)  can  easily  be  shown  to  satisfy  estimate 
(5.16b)  above. 

The  computational  evidence  is  that  (5.16a)  is  valid  under  conditions  stated  there  for  the  non- 
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linear  case.  We  believe  this  to  be  true,  but  do  not  prove  it  here. 


Next  we  state: 


Theorem  (5.2V 

TE  for  the  explicit  and  semi-discrete  methods  based  on  RP  for  general  nonlinear  equations  is  at 

least 


TE  = 0(hs~l)  . 


(5.17) 


Finally  we  analyze  TE  based  on  RD.  Recall  we  are  given  via  interpolation  the  values: 
a, v = hv(d/dx)v  u(xj ) + 0(hw+l),  v = 0,1,. ...AT  - 1 


Next  we  compute 


using  the  matrix  equality: 


bv  = hv 


d_ 

dx 


u(xj ) + OQr1) 


ao 


a.v-i 


1 a,  • • 
0 1 ttl 
0 0 - • • 


a.v-i 
• av_2 


0 0 


0 1 


u(xj) 

u(xj) 


i.V-l 


u(.Xj) 


(5.18a) 
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+ o(ks *1) 


where 


1 + (-1Y 


2 V(v  + 1)! 


(5.18b) 


Call  C the  upper  triangular  Toeplitz  matrix  on  the  right  above.  We  approximately  invert  the 


system,  obtaining 


u(xj) 

\ 

d < \ 


• =c-H  • = 


• yv-i 

h~fc\ 


+ C'1 


+ 0(/rv+1) 


Next  we  compute  the  function  Lr~\x\u)  as  in  (2.25),  for  Xj_v2  s x < Xj+V2 


.v-i  (x  _ x y 
^ t ZZ tL L 


Lv~1(x;u')  = % bv  - 
r-o  A v! 
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“V 

a,V-l 


kj- 

dx 


,V 


u(xj) 


1, 


X - X, 


- r.YV-1 


(*  - 


frV-l 


'-1 


«1 


+ 0(/rv+1) 


To  show  that  TE  = 0(/rv)  in  this  case,  we  need  only  prove  that 

dW^Cxjh)  - «(x)]  o o(frv)  (5.19) 

for  u smooth.  This  follows  by  the  smoothness  in  both  Xj  and  x up  to  0(/rv+1)  of  the  remaining 
terms  on  the  right  side  of  (5.18). 

Thus  we  have: 

Theorem  5.2 

TE  for  the  explicit  and  semi*discrete  methods  based  on  RD  approximation  for  general  systems  of 
equations  is  0(hw). 

We  also  note: 

Remark  (5.1) 

We  have  been  unable,  so  far,  to  prove  that  these  methods  are  indeed  essentially  non-osdllatory 
although  our  present  results  show  that  the  interpolation  upon  which  the  whole  framework  is  based 
does  indeed  have  this  property. 

Remark  (S.2) 

If  u0(x)  has  two  neighboring  discontinuities  and  h is  not  sufficiently  small,  our  present 
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methods,  for  N > 2,  can  result  in  nontrivial  spurious  overshoots.  We  shall  remedy  this  difficulty  in 
subsequent  papers. 


Appendix 

We  shall  provide  the  (lengthy)  details  of  the  proofs  of  Lemmas  (5.1)  and  (5.2). 


Proof  of  Lemma  f5.ll 

We  shall  use  induction  on  N.  The  result  is  trivially  true  for  N = 1.  Suppose  it  is  true  up  to 
N.  We  consider  the  interval 


1 - 


N + 1 


a < 1 


-f 


K - 1 
N + 1 


We  divide 

this  into  two  parts 

1 - 

f \ 

K - 1 

sa<l- 

U-i] 

N 

\ 

[Af+lJ 

h' 

fi- 

\ 

K 

sa<l- 

( K-l ) 

l1 

N + 1 j 

l ^ J 

< 1 - 


f AT  - 2 1 


(Al.a) 


(Al.b) 


after  verifying 


1 


/ 

K - 1 

l*+1J 


< 1 - 


K-KN 


and 


1 


K 

N + 1 


< 1 - 


K - 1 K 

N N + 1 
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N/K  - N + K - 1 < 

A 


K — 1 < N 


Thus  by  the  induction  hypothesis:  for  a € IR,  Q^(x\g)  = Q%-i (x;g),  (if  K = l,  1R  is  empty), 
and  for  a € lL,  Qv(x;g)  = Q%(x;g). 

We  wish  to  show  that  for  a € IR  U IL  that  Q?+l(x;g)  = Q^+l{x',g).  Using  the  iterative  defini- 
tion of  2$+1(;t;g),  we  must  compare  the  two  Newton  coefficients 


-R  = {a-K) 


(a — (JC — 1))  -t h (—  1)*“ 1 


'N+ 1) 

M 


(a-1) 


(A2.a) 


-S  = (ct-(*-l)) 


(a-(^-2))  + 


(A2.b) 


+ 


rN+ 1' 

K- 1 

\ 


(a— 1)  • 


We  wish  to  show 


\R\*\s\ 


(A3) 


for  these  values  of  a. 

To  prove  this  we  need  the  following: 


Fact  (Al) 


M 

f \ 

n 

''o 

i 

+ (-1)* 


V 

k 

V s 


(-1)1 


for  0 £ K ^ n - 1 


Fact  (A2) 
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4)  -c*-i) 


i 


+ ...  + (-i)^ 


-i 


n 

K - 1 


= (-1)^-1 


f*  - 2' 
K - 1 


for  lsjf  sb-1. 


Proof  of  Fact  A1 

Again  we  do  it  by  induction.  It  is  true  for  K = 0.  Suppose  it  is  true  for  AT.  Add 
% + jj  to  both  sides  of  the  equality.  On  the  right  we  have 


(- 1)*+ 1 f 

V ; ((/i  - AT  - 1)!(AT  + 


[n  - 1)1 


1)!  (n-K-l)\K\) 


= c_nir+if (*  ~ 1)1 ) n (AT  + 1)  ) 

J [(«  — AT  — 2)I(Af  + 1)1  J (n  — AT  — 1 (n  - AT  - 1)  J 


(n  - l) 

a:  + 1 


Proof  of  Fact  A2 

Using  induction.  We  see  that  it  is  true  for  K = 1.  Suppose  it  is  true  for  K.  Then 


(by  the  induction  hypothesis) 


422 


= (-1)* 


in  ~ 1)1 

K\(n  - 1 - AT)! 


iH  - 2)! 

(K  - !)!(«  - 1 - AO!  J 


f-iyc  in  - 2)1  n - 1 £_ 

V ' K\(n  -2-K)\  {(n-l-K)  (rt  - K - K) 


- 2' 

K • 


Using  these  facts,  we  have: 


* = (1  - a)(-l)*-i 


N 'l  _ , 

n — r 

K - 1 +(-1)X"2 

V 

K-2 

< 

(A4.a) 


5 = (1  - a)(— 1) 


-1^-2 


a:  - 2 + AT  - 3 

/ \ / 


(A4.b) 


We  note 


(Ar-l'j  ( N \ 

- c-2  -0— )r-i 

/ V / 


w - i 

a:  - 2 


N 

K - 1 


a:  - 1 


Af 


-! 


'at  - n 

K - 2 


(1  - 1)  = 0 


so 


Also 
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K - 1 
N + 1 


N 

K - 2 


(N  ~ 1 
AT  - 3 


We  check 


Tv  - f 

f W - l)  ) _ 

Tv  - r 

N - K + 2 

K - 3 

\ / 

{(N  + 1)(K-  2)  J 

- 3 

k / 

+ 1)(AT  - 2)  J 

|5|  » (1  - a) 


r n 

K -2 


(N  - 1 
K - 3 


isl  * W 


(l-a) 


N! 


N! 


(iiT  - 2)! (AT  - K + 2)1  {K  - 1)!(N  - K + 1)! 


(n  - m 


W - 1)1 


(at  - n 

N + 1 


(a:  - 2)l(AT  - K + 1)1  (K  - 3 )!(AT  - K + 2)1 


[//(AT  - 1)  + N(N  - K + 2)] 


(K  - 1)(N  - K + 2)  + (K  - 1 )(K  - 2) 


N^N 

For  the  two  Newton  coefficients  are:  the  same  R and 


5 = (1  - «)(-!)*  * + (-l)*”1 


K - 1 


This  time  we  have 


= (1  - a) 


Tv  - 1' 

_ js:  - 1 

N 1 

Tv  - 1' 

F-iJ" 

K — 2 

\ J 

^ — 

N 

k-ij- 

K — 2 

\ / 

and 
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pv-  r 

- (1  - a) 

w 

> 

- r 

/ \ 
K 

'N' 

[K~h 

K 

k J 

- 1 

k > 

N + 1 

\ ✓ 

K 
\ / 

rN  - 1' 

i N 

a:  - i 

/ 

N + 1 

/ 

> 0 . 


Finally  we  check 


1*1  * 1*1 


or 


(N  ~ 1) 

2v-  l' 

r ' 

4 

+ 
v— * 
1 

a:  - 2 

k J 

a (1  - a) 

K ~ 1 

V J 

+ 

a: 

k / . 

Of  - 

(K  - 1 )l(N  - K) 


1 + 


K - 1 
N - K + 1 


a ^ QL  ~ 

N + 1 (K  — 1)1  (AT  - K)\ 


i + i 

(i  — a:  -i- 1)  a: 


N KN 

N — K + 1 N+l 


N + 1 

K(N  - K + 1) 


Thus  Lemma  (5.1)  is  proven. 


Proof  of  Lemma  (5,2) 

We  start  with  a general  geometric  result. 


Fact  (A3) 

Given 


£ G2(l)  * 0 

(A5.a) 

j;  QK 0)  * -1 

(A5.b) 
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Then 


(A5.c) 


£ $(  1)  ^ ± Q%{ 0)  . 


Q%{x)  a 0 for  0sxsl 

dr1 


(A6) 


Proof  of  Fact  A3 

RoUe’s  Theorem  tells  us  that  d/dx  Q^(x,g)  = 0 at  least  once  in  each  interval 
(1,2),..., (K  - 1 ,K)  and  d/dx  Q%(x,g ) = -1  at  least  once  in  each  of 

(K  — N , K — N + 1),. ..,( — 2, — 1),  (—1,0).  If  K = 1,  this  means  that  d^/dxr  Q^(x,g)  = 0 at  least 
N - 2 times  for  x < 0.  Thus  d/dx  Q% (x,g)  is  monotone  for  0 :£  x < 1.  If  K = N,  then  a simi- 
lar argument  shows  that  fi/dx1  £$(*;$)  = 0 at  least  N -2  times  for  x > 1 and  the  same  monoti- 
dty  result  follows.  Given  (A5(c)),  this  takes  care  of  these  two  cases. 

For  1 < K < N we  proceed  as  follows.  If  d/dx  Q%=  0 at  least  once  in  addition  to  these 
values  mentioned  above  for  1 £ x s K,  then  it  equals  0 at  least  K times  for  x a 1 and  -1  at 
least  N - K time  for  x ^ 0.  By  our  usual  argument  this  means  that  it  is  monotone  on  (0,1),  and 
we  are  finished.  Similarly  if  d/dx  Q%(x)  - -1  at  an  additional  point  for  K — N s x s 0,  the  same 
conclusion  follows. 

If  both  of  these  possibilities  are  false,  then  the  graph  of  Qg(x;g)  looks  like  for  Isx: 

(a)  Kodd 


Fig.  Al:  q|J  for  x > 1 
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and  for  K - N s x ^ 0,  the  graph  looks  like: 


(a)  N - K odd 


(b)  N — K even 


Fig.  A2:  Q%  for  x s 0 


If  the  leading  coefficient  of  Q%  vanishes,  then  we  have  a polynomial  of  degree  jV  — 1,  and  its 
derivative  is  monotone  on  (0,1)  as  per  our  usual  argument. 

Otherwise  we  consider  the  following  cases. 


Case  fll  K and  N — K even.  Then  if  the  leading  coefficient  is  positive  it  follows  from  glancing  at 
Fig  (A2.b)  that  d/dx  Q%  = — 1 for  some  x < K — N and  we  are  finished.  If  the  coefficient  is  nega- 
tive then  Fig.  (Al.b)  shows  us  that  d/dx  Q%  = 0 for  some  x > K and  we  are  again  finished. 
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Case  (2\  K and  N - K odd.  Then  if  the  coefficient  is  positive  Fig  (Al.a)  shows  d/dx  Q$  = 0 for 
some  x > K.  If  the  coefficient  is  negative  then  Tig  (A2.a)  shows  d/dx  Q%  = - 1 for  some 
x<K  -N. 

Case  (3)  K odd,  N - AT  even.  If  the  coefficient  is  positive  then  Fig  (Al.a)  shows  d/dx  Q%  vanishes 
for  some  x > K. . If  the  coefficient  is  negative  then  Fig.  (A2.b)  gives  us  the  desired  result. 

Case  (4)  K even,  N - K odd.  If  the  coefficient  is  positive  then  Fig.  (Al.a)  gives  us  the  desired 
result.  If  the  coefficient  is  negative  then  Fig  (Alb)  does  it. 

To  prove  Lemma  (5.2)  we  need  only  verify  the  inequalities  (A5).  We  finally  write  down  the 
formula  for  0%: 


Lemma  (All 


+ 


(-i  r2 


'N  - K + j - 
N - K 

s 


where  we  define 


u 


= 0 if  either  B < 0 or  B > A. 


Proof: 

Clearly  Q%  = a — x for  x-K-N,K  — N + 1,...,0. 
For  x = l,2,...,Jk,  we  need 


(A.7) 
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a — x + 2 

y-i 


(N  - K + x 
N-K  + j) 


'N-K  + j-2) 

j- 2 J 


a - ax-iy-1 


= 0 


{N  - K + j - ll 

y-i 


+ (-i  y-2 

This  will  follow  if,  for  all  integers  M & 0: 


(A.8) 


Fact  fA41 


("  + ;](  iy~‘ 


fM+J-ll 

M 


, v a 1 


Fact  (A.5); 


v-iK:;;j)(-iy-f  "j-11 


Proof  of  Facts  (A41  and  fA.51 

We  shall  again  use  induction:  For  M = 0 we  need 


v 1 


0 = - 


l(-iri0- 


s (-i  y 

y-o 


V 

£ 


This  follows  from  Fact  (Al)  for  v = n - K + 1. 
We  also  need 


v-  2 

y- 1 


v fv  + l) 

5 1,  • 


■it:;)™-- (■:’)*(”■) 


= V 


by  (A.  10). 

Suppose  both  Facts  are  true  up  to  M.  For  M + 1 we  need 


(A  9a) 


(A  9b) 


(A  10) 


(All) 
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i=  2 


/- 1 


(Af  + 1 + v'l 
A/  + 1 + j 


y \M  ± 1 + v" 

y-i  U + 1+;; 


" + i + y l " J 


- y — i — 
yfi  Af+1 


M + 1 + vlf-iy-iPl/  + J”  *1 

Af  + 1 + y'J''  V [ A/  J 


(A  12) 


_ Af  + 1 + v 
N + 1 


Af  + 1 


(by  the  induction  hypothesis) 


Now  we  show  (A. 9b)  is  true  for  Af  + 1 by  induction  on  v.  It  is  clearly  true  for  v = 1.  If  it 
is  true  for  v — 1,  we  consider 


1=2  fM  + vl 


(_iy-i  (M  + J~ 


(M  + j-l) 

Af  J 


j;  (Af  + v + l' 

y?i  + 


c-iy-fT1) 


+ (-«rf +;-xl 


+ 2 

y-i 


( Af  + v 
Af  + y + 1 


Af  + y - 
Af 


n<- 


iy 


•(v  - i)  + 2 

y-i 


or 


v 


V 


2 

y-i 


Af  + v + 1' 
Af  + y + 1 

/ 


(-iy-i 


M + y 
Af 


‘) 


Now  we  verify  (A5.a) 


j-  eSK°)  = -i  + 1 (-i y- > 

y-i 


(N  ~ AT11 
(N-AT  + y)! 
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a-oc-iy-f  7-V'1 


+ (-iy-2 


'N  - K + j -2) 
j-  2 


s -1 


or 


(1  - a)  i { I ] - 4 

^ j?i(N-Z+j)  y-i  (fl-K  + j)(N  - 


K+j-  1) 


or 


-a  2 i + (V-A0  2 


y-.v-ff+i  J 


j-Pt-R+l  j(j  ~ 1) 


We  use  the  identity:  for  A < B 

3 1 3 

y-A  JU  ~ 1)  ~ y?4 


u - 1 


= 1 _ J_ 

A -1  B 


So  (A.  13)  becomes 


-a  x 7 + (*- WttSf-v 
y-.v-r+i  J “ & W 


#*(  S y 

N (J-N-K+ 1 


. HlJbbH) 


.V 


1 


o a x 4(v  - * + l)  - k . 

J-N-K+ 1 J 


If  we  replace  the  right  side  above  by  K + 1,  we  get 


.v 


X -r(v  - £)  - K - 1 
y-.v-tf  J 


= X -t(w  - * + 1)  - at  - x 4 

y-.v-*+i  ./  y-.v-Jt+i  1 


1 


(A.  13) 


(A  14) 
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Thus  the  right  side  above  is  decreasing  with  K,  and  we  need  only  verify  the  inequality  for 


K = 1 


0 


M 


-1  = 0 


Next  we  compute: 


= _ i + 


( S-K+ 1 . ' 

2 1 

v-l  V 


(1-a) 


* 

+ 2 


y"2 


w - a:  + iw>  - 2if-iy~2 

(N-K  + j)  1 


(1  - «)(-iy-i " 


-K  + j 

j-  1 


+ c-iy-2 


(N  - AT  + j - 2^ 

j -2 


Rearranging  terms  and  simplifying  gives  us: 
(-a) 


(n  - at+  r 

1 

(N-AT  + yJ 

J~  1 

+ 2 (N-K+l) 


j-2  (N-K  + j-l)(N-K  + j ) 


1 . 


. Now 


v N - AT  + 1 _ y 1 _ y 1 

A 0-l)(W-K  + j)  yt'aJ-l  y-2  N-K  + j 

The  first  term  in  (A.  15)  this  becomes 


.v 


1 


(1  - a)  2 7 


Using  the  identity  (A  14),  the  second  term  becomes: 


(tf-AT+1) 


N - K + 1 N 


= 1 - 1 + 


k - i _ a:  - l 


(A  15) 
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So  we  have  to  check: 


(1'a)ii+^1 


|yi  + K ~ 1 

N A j ff 


s 1 , 


Again,  if  we  replace  AT  by  AT  + 1 on  the  left  side  above,  it  increases  by  1/JV  2*  +1 1//.  Thus 
we  need  only  verify: 


_1_  + ^ ~ l < i 
N N N 


lsl. 


The  last  step  is  to  verify  that: 

£ ®°) s £ eaD 

or 


,v 

-«  2 

J-N-K+ 1 


f ss(l-«)Sj  + 

n J-k  J 


K - 1 

N 


or 


x 

AT 


.v  1 ,v 

(1  - a)  2 4 + Ot  2 

y-tf  i y-.v-jc+l 


j 


‘V-i  i r .v-t  , .v-i  . 

os27  + » 2 7-2-7 

y-JC  J (J-N-K+ 1 ^ y-ff  J 


.v-l  , .-y-1  , 

Os(l-a)2-r  + a 2 4 . 

)-k  J y-.v-jc+i  y 


.V—  1 


Thus  Lemma  (5.2)  is  proven. 
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ON  NUMERICAL  DISPERSION  BY  UPWIND  DIFFERENCING 


Bram  van  Leer 

l 

Delft  University  of  Technology 
Delft,  The  Netherlands 


ABSTRACT 

Upwind-biased  difference  schemes  for  the  linear  one-dimensional  convection 
equation  are  defined.  It  is  demonstrated  that  the  numerical  dispersion  caused 
by  such  schemes  changes  sign  in  the  middle  of  the  allowed  GFL-number  range. 
This  makes  it  possible  to  annihilate  dispersive  errors  in  two  steps. 
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1.  INTRODUCTION 


Upwind  differencing  is  a way  of  differencing  convection  terms.  For 
the  scalar  convection  equation 


+ au  =0, 
t x 


(i) 


discretized  on  a uniform  grid  {jAx,nAt},  the  best-known  upwind-differ- 
ence approximation  is  the  explicit  first-order  scheme  of  Courant, 
Isaacson  and  Rees  (CIR)  [1], 


n+ 1 n n n 

u.  - u.  u.  - u.  , 

_J 1 + a J J -1  = 

At  a Ax 


0,  a > 0, 


(2.1) 


n+ 1 n n n 

u.  -u.  u. , , - u. 

— 1 + a J Ax  3 = 0,  a < 0. 


At 


(2.2) 


Introducing  the  Courant-Friedrichs-Lewy  (CFL)  number 


„ At 

O - a "7  m 

Ax 


(3.1) 


we  may  rewrite  (2)  as 


n+1 
u . 

J 


= (1  - 


where 


s = sgn  a. 


(3.2) 


(3.3) 
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The  scheme  is  stable,  even  in  the  maximum  norm,  under  the  CFL  condition 


a < l. 


n+l  . 


The  value  of  u^  given  in  (3.2)  may  be  regarded  as  an  approximation, 
by  linear  interpolation,  to  the  value  of  the  exact  solution 


n+l  , * n \ 

u.  = u(x.  - a Ax,  t ) , 
j J 


which,  for  non-integer  cr,  gets  lost  in  the  process  of  discretization. 

The  interpolation  at  tn  involves  only  the  two  nodal  points  nearest  to 

x.  - oAx.  Thus,  the  numerical  domain  of  dependence  of  u?+^  is  upwind- 
J J 

biased. 


The  upwind  bias  becomes  more  obvious  as  larger  values  of  the  CFL 
number  are  allowed.  If  m is  an  integer  such  that 


m < a < m + 1 , 


(6.1) 


a stable  upwind  scheme  is  [2] 


u.  = (m  + 1 - (j)u1?  + (a  - m)u?  t . 

J j-m  j-m-1 


(6.2) 


Upwind  differencing  is  often  compared  to  central  differencing, 
where  the  numerical  domain  of  dependence  of  uj+*  at  tn  is  centered  on 
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x j , the  outcome  usually  being  that  upwind  differencing  is  considered 
superior  but  more  complicated  (because  of  the  search  implied  in  (6.1)) 
and  central  differencing  inferior  but  simpler  (no  search  needed).  Up- 
wind differencing,  it  is  said,  stays  closer  to  the  physics  contained  in 
the  convection  equation.  If  this  indeed  is  desirable,  one  should  be 
able  to  measure  the  benefit.  That,  apparently,  is  not  so  easy:  to 

date,  very  few  quantitative  theorems  have  been  proven  supporting  the  up- 
wind claim  to  a higher  accuracy. 

One  piece  of  evidence  can  be  found  in  [3]  where  Fromm's  [4]  "zero- 
average  phase-error"  scheme  (an  upwind-biased  scheme  of  second-order 
accuracy)  is  shown  to  yield  the  lowest  L^-error  in  convecting  a step 
function,  in  comparison  to  all  other  second-order  schemes  based  on  the 
same  data.  This  suggests  the  use  of  upwind  schemes  for  shock-propaga- 
tion problems,  an  area  of  application  in  which  these  schemes  indeed  are 
unrivalled  [5]. 

Another  quantitative  statement  was  presented  by  me  without  proof 
in  [6];  it  concerns  the  lack  of  numerical  dispersion  by  upwind  schemes 
at  which  Fromm  already  hinted.  This  will  be  the  subject  of  the  re- 
mainder of  the  paper. 
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2.  AN  OPERATIONAL  DEFINITION  OF  UPWINDING 


To  avoid  cluttering  up  the  formulas,!  shall  restrict  the  value  of 
the  CFL  number  to  the  interval  [0,1]. 

Definition . A scheme  for  Eq.  (1)  of  the  general  form 


n+1  V /~\  n 

uj  * i ck(0)Vk 

(7.1) 

is  called  upwind-biased  for  the  CFL -number  range  [0,1] 
cients  satisfy  the  symmetry  relation 

if  its  coeffi- 

V - °>  - 

(7.2) 

Eq.  (7.2)  does  not  imply  consistency  of  scheme  (7. 
for  this  we  need  to  impose  two  more  conditions: 

1)  with  Eq.  (1) ; 

l ck(a)  = l , 
k K 

(8.1) 

l kc  (a)  = -a. 

(8.2) 

A detailed  analysis  is  needed  to  find  the  condition  on  the  coefficients 
that  will  ensure  stability  of  the  scheme  for  all  values  of  0 in  the 
range  indicated. 


It  is  possible  to  make  scheme  (7.1)  yield  the  correct  translated 
initial-value  distribution  for  integer  values  of  G;  this  clearly  is 
useful.  The  additional  condition  needed  is 

ck(0)  =0,  k * 0.  (9) 

3.  NUMERICAL  DISPERSION  BY  UPWIND  SCHEMES 


When  updating  the  solution  with  a scheme  of  the  form  (7.1),  we 
generally  introduce  both  dispersive  and  dissipative  errors.  That  is, 
the  Fourier  components  of  the  initial-value  distribution  are  convected 
by  the  scheme  at  the  wrong  speed,  while  also  being  damped.  Only  for 
integer  values  of  O these  errors  can  be  avoided  simultaneously.  For 
non-integer  values  of  O all  consistent  stable  schemes  of  the  form  (7.1) 
must  be  dissipative,  since  they  are  not  invariant  under  time  reversal. 
With  upwind-biased  schemes  at  least  the  dispersion  may  be  avoided,  as 
shown  below. 


Lemma . For  any  scheme  that  is  upwind-biased  for  the  CFL -number 
range  [0,1],  the  result  of  one  step  with  CFL  number  a followed  by  a 
step  with  CFL  number  1 - O is  free  of  dispersion. 


Proof . Assume  initial  values  according  to 


n n iaj 

uj  - v 5 


do) 
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any  upwind-biased  scheme  with  CFL  number  O 6 [0,1]  may  then  be  written 


as 


n+1 
u . 

J 


g(a,a)u” 


(11.1) 


with  amplification  factor 


g(o,a) 


K-l 

l 


k=-K 


ck(a) 


iak 

e 


K > 1 . 


(11.2) 


The  same  scheme  applied  with  a CFL  number  1 - O has  an  amplification 
factor 


g(l-a,a) 


K-l 

1 


k=-K 


ck(l-a)e 


iak. 

9 


(12.1) 


by  virtue  of  (7.2)  we  have 


g(l-a,a) 


K-l 

l 


k=-K 


c-k-i(a)e 


iak 


(12.2) 


Introducing  Jl  = -k-l  leads  to 


g(l-c,a) 


K-l 

l 


£=-K 


c^(CJ)e 


-ia(£+l) 


-ia  1 . . -iaSL 

e l cp (a)e 

K 


= e iag*(c,a). 


(12.3) 
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The  composite  scheme,  with  a CFL  number  of  1,  has  an  amplification 
factor 


~icx 

g(l-a,a)g(a,a)  = e g*(a,a)g(a,a) 

= e"ia|g(a,a) |2,  (13.1) 

to  be  compared  to  the  amplification  factor  for  the  exact  solution  at  a 
CFL  number  of  1 : 


n+1  -ia  n 
u.  = e u. . 
J J 


The  two  factors  are  identical  in  phase.  □ 


The  above  lemma  has  an  interesting  consequence. 


03.2) 


Corollary.  An  upwind-biased  scheme  for  the  CFL-number  range  [0,1] 
has  no  dispersion  for  a CFL  number  of  ^ . 

Proof.  Apply  the  previous  lemma  to  the  case  a = j.  Since 
a = 1 -0  = 4-,  the  two  steps  with  the  upwind  scheme  have  the  same 
amplification  factor 


with  the  correct  phase  -a/2.  □ 


(14) 
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A geometric  interpretation  of  this  corollary  for  the  CIR  scheme  is 


arg  g(0,a)  is  called  \)j. 


5 


It  further  follows  that  for  any  value  of  a the  dispersive  error 
changes  sign  when  0 passes  through  0 (illustrated  for  the  CIR  scheme 
by  Figure  2),  while  the  damping  factor  |g(0,ot)|  goes  through  an  ex- 
tremum [6],  For  all  practical  schemes  this  extremum  is  an  absolute 
minimum.  Thus,  in  an  upwind-biased  scheme,  minimum  dispersion  and 
maximum  dissipation  go  hand  in  hand.  This,  again,  leads  to  the  repre- 
sentation of  moving  discontinuities  with  comparatively  little  ringing. 

Besides  upwind-biased  schemes  for  a CFL-number  range  of  the  type 
[m,m+l]  there  are  upwind-biased  schemes  for  the  range  [m-l,m+l].  These 
are  obtained  by  shifting  the  center  of  a central-difference  scheme  up- 
wind over  m meshes.  An  example  is  the  fully  one-sided,  second-order 
scheme  for  the  CFL-number  range  [-2,0], 

uj  1 = - f C1  - a)uj_2  + a(2  " a)uj-i  + -j  0 - a)  (2  - a)uj.  (15) 
The  coefficients  of  this  scheme  satisfy  the  relation 

ck(2  - a)  = c_k_2(0);  (16) 

Accordingly,  a step  with  CFL  number  0 should  be  followed  by  a step  with 
CFL  number  2 - 0 in  order  to  achieve  zero  dispersion  at  a net  CFL 
number  of  2. 

For  central-difference  schemes  the  corresponding  relation  is 


ck(-0)  = c_k(0); 


(17) 


hence,  annihilation  of  phase  errors  cannot  be  combined  with  a net  ad- 


vancement in  time. 


2 3 4 5 7 10  15  20  50  i/Ax  100 


Figure  2.  Velocity  dispersion  versus  wavelength  for  the  CFL  scheme 
The  wavelength  L is  related  to  a by  a = 2ttAx/£;  the 
ratio  of  computed  to  exact  convection  speed  is  evaluated 
as  ipl(-aa) . 
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ABSTRACT 

AZTEC  (Adaptive  Zoom  Tracking  - Experimental  Code)  is  a code  to  solve  the 
one-dimensional  gas  dynamic  equations  in  a variable  area  duct  with  specific  implemen- 
tation for  plane,  cylindrical,  and  spherical  geometries.  The  program  uses  a fixed,  locally 
and  adaptively  refinable  grid,  together  with  a set  of  moving  grid  points  which  migrate 
through  the  fixed  grid.  The  moving  points  represent  shocks  or  contact  discontinuities, 
and  they  can  be  created  or  destroyed,  usually  as  the  result  of  a collision.  Mass,  energy, 
and  momentum  (the  last  only  in  the  constant  area  case)  are  exactly  conserved,  except 
after  a collision;  in  that  case  the  conservation  error  is  reduced  to  invisible  levels  by  spa- 
tially localized  partial  time  stepping.  The  basic  difference  scheme  for  both  the  fixed  and 
moving  grid  is  Godunov’s  method,  with  the  Riemann  solver  used  to  compute  both  cell 
boundary  fluxes  and  the  speeds  of  the  moving  points.  Tracking  of  rarefaction  waves  on 
the  moving  grid  is  difficult  with  this  method  since  the  waves  must  be  represented  as 
piecewise  constant.  In  one  version  of  AZTEC  the  rarefaction  waves  are  recorded  on  the 
fixed  grid  with  the  Lax-Wendroff  difference  scheme  with  a small  additional  viscosity, 
and  most  of  the  numerical  experiments  have  been  performed  with  this  version.  In 
another  version  the  polytropic  gas  equation  of  state  has  been  replaced  by  one  in  which 
the  pressure  is  a continuous  piecewise  linear  function  of  specific  volume  at  constant 
entropy.  With  this  assumption  the  solution  of  each  Riemann  problem  is  piecewise  con- 
stant. and  our  method  is  exact  until  the  wave  structure  becomes  too  complicated.  Some 
preliminary  numerical  results  are  exhibited  for  this  version. 


• Sponsored  by  the  U.  S.  Department  of  Energy  under  contract  W-7405-ENG.36.  The  publisher  recog- 
nizes the  U.  S.  Government  retains  a nonexclusive,  royalty-free  license  to  publish  ot  reproduce  the  pub- 
lished form  of  this  contribution,  or  to  allow  others  to  do  so,  for  U.  S.  Government  purposes. 
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1.  AFTER  SOD. 


Sod  s survey  paper  [l]  was  a milestone  in  the  development  of  numerical  methods 
for  one  dimensional  gas  dynamics,  for  it  clearly  exposed  the  shortcomings  of  some 
methods  which  were  in  vogue  at  the  time.  It  seems  appropriate  to  point  out  that  two 
techniques  which  were  not  included  in  the  survey  are  adaptive  grid  refinement  and  the 
method  of  characteristics.  Proper  application  of  the  latter  requires  some  form  of  front 
tracking,  so  that  the  programming  of  both  methods  is  considerably  more  complicated 
than  for  shock  capturing  schemes. 

While  just  a modest  amount  of  localized  grid  refinement  will  improve  a shock  cap- 
turing method,  there  are  pitfalls.  We  refer  the  reader  to  [2],  The  method  of  characteris- 
tics has  two  interpretations.  In  the  first,  the  characteristic  curves  become  coordinate 
lines.  Since  there  are  three  characteristics  for  the  gas  dynamic  equations  , two  of  them 
must  be  chosen.  The  natural  choices  are  the  u+c  and  u-c  characteristics.  In  the  case  of 
isentropic  flow,  this  means  that  differencing  along  the  characteristics  requires  no  inter- 
polation. In  the  non  isentropic  case  values  on  the  third  characteristic  must  be  obtained 
by  interpolation. 

The  second  expression  of  the  method  of  characteristics  is  a form  of  upstream 
differencing.  The  idea  is  roughly  the  following.  Write  the  gas  dynamic  equations,  or 
any  hyperbolic  system,  in  the  form  w,  + Awx  = 0.  Let  lj  . j — 1.  • • • ji . be  the  left 
eigenvectors  of  A . with  eigenvalues  X } . Then 

lj(w,  + \jwx)  = 0.  (1.1) 

This  is  differenced  explicitly,  using  backward  spatial  differences  for  positive  \j  and  for- 
ward differences  for  negative  \j . More  precisely, 

IjCw?  +1  — w/1  + fij  (tv/1  — w£))  = 0 (1.2) 

where  fij  =\jAx/&t.  and  k =i  — 1 if  fij  >0.  k —i  +1  if  fij  <0.  If  there  are  discon- 
tinuities present,  they  must  be  tracked  through  the  grid  in  both  versions. 

AZTEC  combines  grid  refinement  and  tracking,  using  conservative  differencing. 
The  tracking  is  most  easily  done  with  Godunov’s  method,  using  moving  grid  points  to 
locate  the  discontinuities.  A condition  for  the  stability  of  Godunov's  method  is  that  the 
fluxes  on  the  cell  boundaries  remain  constant  during  a time  step.  We  found  that  the 
simplest  way  to  do  this  in  our  context  was  to  remove  fixed  grid  points  near  the  moving 
ones  by  locally  coarsening  the  spatial  grid.  This  is  inaccurate  if  the  moving  point  is  in  a 
region  with  spatial  variation,  but  we  counteract  that  with  a local  grid  refinement  which. 


as  described  later,  refines  in  both  space  and  time.  The  Riemann  solver,  which  provides 
the  fluxes  for  the  conservative  difference  equations  also  determines  the  speeds  of  the 
moving  points,  as  suggested  in  [3], 

In  section  2 we  give  details  of  the  grid  refinement  procedure.  In  section  3 we  dis- 
cuss the  moving  grid.  In  section  4 we  exhibit  the  result  of  some  computations.  In  sec- 
tion 5 we  present  some  preliminary  results  for  a piecewise  linear  equation  of  state. 


2.  GRID  REFINEMENT. 

The  one  dimensional  gas  dynamic  equations  for  a variable  area  duct  are 

(a  (x  )p),  + (a  (x  )pv  )x  = 0 (2.1) 

(a  (x  )pv  ),  + (a  (x  )(pv2  + p ))x  = pax 
(a  (x  )pE  ),  + (a  (x  )v  ( pE  +p  ))x  = 0, 

where  p is  the  mass  density,  v is  the  velocity,  E = e +(l/2)v2,  e is  internal  energy,  and 
p is  the  pressure  with  equation  of  state  p — p (p.e  ).  The  quantity  a (x  ) is  the  area 
function. 

Our  program  was  originally  written  for  slab  geometry.  It  was  pointed  out  to  us  by 
J.M.Hyman  that  an  easy  way  to  extend  a fixed  grid  slab  code  to  handle  variable  area  is 
to  introduce  area-weighted  variables.  Thus,  we  let 

w = (a  (x  )p,a  (x  )pv  a (x  )pE  Y 

so  that  the  equations  become 

w,  +(a/  (w  /a  ))x  = g , (2.2) 

where  / is  the  flux  vector  given  by 

/ = (pv  ,pv  2 + p ,v  (p£  + p )Y 

and 


g = (0  ,pax  .0  Y 

Suppose  that  we  have  a uniform  grid  of  N cells  indexed  by  i.  The  quantity  w“ 
will  be  the  average  of  w in  the  cell  at  time  n.  x;  is  the  coordinate  of  the  ceil  center; 
X,-  +i/2  is  the  coordinate  of  the  interface  between  cell  i and  i + 1.  The  area  at  a cell  edge 
is 
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°i+l  12  ~ a (*i  + 1/2)* 


but  the  area  of  the  cell  center  is  defined  by 

ai  — a (xj_i/2  ^i+1/2) 


where 


a (x  ,y  ) = 


If  a (s  )ds  (y  -x  )“* 


and 


a(x  j c)  = a(x). 

The  basic  conservative  difference  equation  is 

Ax  w?  +1  = Ax  w,"  - A t [(a/  ),  +V2  - ( af  ),  _1/2]+g  Ax  A t.  (2.3) 

If  the  cell  interface  with  index  i +1/2  is  internal . that  is,  if  cells  i and  t +1  are  both 
present  on  the  grid,  we  allow  two  possible  definitions  of  the  numerical  flux  /j+i/2  • 

The  Godunov  flux  is  obtained  by  solving  the  Riemann  problem  centered  at  x = 0 
with  left  state  given  by  Wj/oj  and  right  state  given  by  wi+1/ai+l.  The  flux  function 
evaluated  at  x = 0,  t >0  is  then  used  for  f i+  j/2. 

The  Lax  —Friedrichs  flux  is  defined  as  follows.  Set 

wi+ 1/2  = -5K-+1  + Wi  - At  / Ax  (/j+1  - f , )a(xi+  m )]. 

and  then 

/ i +1/2  = / (wi  +1/2/ ai  +1/2)- 

In  the  uniform  area  case  if  the  fluxes  at  both  cell  boundaries  are  Lax-Friedrichs  fluxes 
then  w?  +1  becomes  the  two-step  Lax-Wendroff  scheme. 

The  choice  of  fluxes  is  part  of  the  experimentation  with  AZTEC.  However,  an 
invariable  strategy  that  we  have  implemented  is  to  always  use  Godunov  fluxes  on  the 
finer  grids  (if  they  exist),  at  the  external  boundaries,  and  at  cells  in  a neighborhood  of  a 
moving  grid  point.  If  the  Lax-Friedrichs  flux  is  used  at  all  on  the  coarse  grid,  it  is  in  an 
expansion  region. 

The  grids  are  defined  in  terms  of  cells  rather  than  points.  The  symbol  j will 
always  identify  a grid  level,  j = 1,2,  • • • J . The  maximum  number  of  grid  levels,  J . is 
an  input  parameter.  Level  1 represents  the  coarsest  grid,  with  N(l)  = N cells  each  of 
length  Ax  (1)  = Ax . Level  2 is  a refinement  of  level  1 obtained  by  dividing  each  cell  of 


452 


level  1 in  half,  so  that  Ax  (2)  = .5  Ax  (l).  and  N(2)  = 2N(l).  Thus. 
Ax  O' ) = 2~^-1)Ax  , and  N(j)  = 2J~1N . 

Since  the  refinement  is  local  and  adaptive,  not  all  cells  on  every  level  will  be 
advanced  at  every  time  step.  There  are  two  kinds  of  cells,  live  and  dead . At  the  start 
of  a time  step  the  level  1 cells  are  all  live.  For  ; > 1.  a cell  on  level  j will  be  live  only 
if  its  parent  cell  on  level  j — 1 is  live  and  if  certain  tests  of  the  state  variables  on  level 
j —1  indicate  that  refinement  (splitting)  of  the  parent  cell  is  required. 

There  will  be  two  kinds  of  live  cells,  sterile  and  fertile . A sterile  cell  is  one 
which  is  not  to  be  split  and  which  therefore  must  be  advanced  by  the  difference  equa- 
tion. A fertile  cell  is  one  which  splits  into  two  daughter  cells  on  the  next  level  and 
which  is  therefore  not  advanced  by  the  differential  equation.  The  advancement  of  a 
sterile  cell  requires  the  computation  of  fluxes  at  the  cell  boundaries,  but  computation  of 
the  flux  at  a fertile  cell  boundary  will  be  needed  only  if  that  cell  is  not  contiguous  at 
that  boundary  to  a fertile  cell  on  the  same  level.  Since  AZTEC  is  designed  for  serial 
computation  we  have  tried  to  avoid  redundant  calculation  of  fluxes.  This  arrangement, 
which  is  not  quite  as  complicated  as  it  sounds,  is  shown  schematically  in  Figure  2.1. 


level 


Fig.  2.1.  Boundary  fluxes  at  grid  interfaces. 

The  boxes  represent  cells  on  the  indicated  grid  level.  The  vertical  sides  of  the  boxes 
represent  the  cell  boundaries.  A dotted  line  means  that  the  flux  is  not  computed  on  that 
grid  level.  The  flux  is  computed  at  a solid  line.  A solid  line  with  an  arrow  means  that 
the  flux  computed  at  that  grid  is  used  at  the  next  grid  level.  Thus,  at  an  interface 
between  grids  j and  j+ 1 . the  coarse  grid  flux  supplies  the  boundary  condition  for  the 
finer  grid.  This  generalization  of  [4]  enables  us  to  maintain  conservation  . 
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Since  our  difference  scheme  is  explicit,  the  time  step  for  level  j must  be  half  that 
for  level  j—  1.  An  example  of  the  evolution  of  the  space-time  grid  is  shown  in  Figure 
2.2. 


Ax 


Fig.  2.2.  Space-time  refinement. 

Note  that  refinement  has  occurred  between  the  coarse  time  steps.  Here  is  the  algorithm 
for  setting  up  and  advancing  the  grids. 

BEGIN  ALGORITHM 
j = l 

1 ic(j)  — 0 *ic( j)  is  0 for  the  first  pass  through  level  j,  1 for  the  second  pass* 

2 call  CREATE(j)  *Determine  and  label  the  fertile  and  sterile  cells  on  level  j,  label 
and  provide  data  for  the  live  cells  on  level  j+1.  Set  nc(j+l)  - number  of  live  cells  on 
level  J+1.* 

call  FLUX(j)  *Compute  and  store  fluxes  on  level  j at  those  interfaces  which  are  a 
boundary  of  at  least  one  sterile  cell.* 

call  ADVANCE(j)  * Compute  wn  +1  on  level  j and  overwrite  on  wn  , for  sterile  cells 

only* 

if  j < J and  ncCj+1)  & 0 

then  j - j+1  and  go  to  1 
3 else  if  ic(j)  ■ 0 
then  if  j - 1 

then  step  finished 
else  ic(j)  - 1 
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go  to  2 
else  j « j-1 

call  CONST(j)  * The  total  conserved  quantity  (mass,  for  example ) in  a 
fertile  parent  cell  is  defined  to  be  the  sum  of  the  conserved  quantities  of 
the  two  daughter  cells.* 
go  to  3 

END  ALGORITHM 

The  subroutine  CREATE  requires  further  discussion.  First,  it  must  determine 
which  cells  on  level  j are  to  be  split.  This  is  done  by  performing  two  tests  for  each  cell. 
If  l is  the  cell  index,  then  one  of  the  tests  looks  for  moving  grid  points  in  the  cells 
l —2.  l — 1,  l , I +1. 1 +2  (see  section  3).  If  there  are  any.  then  cell  l splits  into  two.  Of 
course,  special  provisions  have  to  be  made  for  cells  close  to  the  boundaries.  The  second 
test  splits  cell  l if  there  is  a compression  in  the  same  neighborhood  as  above:  other  cri- 
teria could  be  included.  Now.  suppose  that  in  advancing  from  t to  t + At  CREATE 
finds  that  a cell  on  level  j must  be  split  into  two  daughter  cells  on  level  j +1.  There  are 
two  possibilities:  the  daughter  cells  were  present  at  the  previous  time  step  and  were 
advanced  to  time  t by  the  algorithm,  or  they  were  not.  In  the  former  case  no  new  data 
need  be  created  for  the  daughters.  In  the  latter  case  data  is  obtained  by  interpolation.  If 
the  parent  cell  on  level  j has  index  i , the  interpolation  is  as  follows.  Let  L and  R be 
the  indices  of  the  left  and  right  daughter  cells,  respectively,  and  let 

wL  = 1.25Wj  - .25(aj/a)+1Vi+1. 

wR  = .75wj  +.25(flj  /aj+iVj+i 

If  both  wL/aL  and  wR /aR  lie  between  w^Ja,^  and  wi+1/ai+1,  accept  wL  and  wR  as 
the  interpolated  values.  If  not,  let 

wL  = .75wj  + .25(aj/a|_iVi.i 

wR  = 1.25wj  — 

Use  these  as  the  interpolated  values  unless  the  above  monotonicity  test  fails,  in  which 
case  set 

wL  = (ai/a.Vi, 
wR  = ( aR  la  -,  Vi  • 

The  latter  is  also  used  if  cell  i is  at  the  boundary  of  the  physical  domain. 
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3.  THE  MOVING  GRID. 


The  moving  grid  points  will  move  through  the  fixed  grid  and  exchange  data  with 
the  fixed  cells.  We  have  chosen  to  do  this  for  the  finest  grid  only:  this  is  arranged  by 
having  one  of  the  refinement  tests  look  for  moving  points  in  the  two  cells  on  each  side 
of  the  current  cell.  If  the  points  are  not  allowed  to  move  more  than  the  length  of  one 
cell  in  one  time  step,  they  cannot  leave  the  fine  grid. 

The  moving  gridpoints  define  boundaries  of  skewed  space  time  cells  in  which  the 
conservation  laws  are  applied  just  as  they  were  for  the  fixed  grid.  For  the  two  points 
x < y shown  in  Figure  3.1,  the  difference  equation  is 

[y  -X  +(c Ty  —<rx  )Af  ]vv*y  = (y  -X  )wxy  - At  [a  (y  ,y  +ay  At  )Fy  —a(xjc  +trx  At  )FX  ] 

+ g Af  . (3.1) 


A 


Fig.  3.1.  Space-time  cell  defined  by  moving  points. 

The  quantity  wxy  is  both  the  right  value  for  the  discontinuity  at  x and  the  left  value 
for  the  one  at  y . The  term  g At  only  appears  in  the  momentum  equation.  To  avoid 
false  accelerations  it  must  have  the  following  form.  If,  in  the  momentum  equation, 

Fy  = pvy2  + Py . 

and  similarly  for  Fx  , then 

g = (1/2 )(px  + py  )[a  (y  ,y  + cr,,  At  ) - a (x  pc  +trx  At  )]. 

There  are  two  things  that  must  be  provided  in  the  basic  difference  equation  above: 
the  speeds  <r  and  the  fluxes  F . These  are  obtained  from  the  solution  of  Riemann  prob- 
lems. In  order  to  do  this  we  must  first  recover  the  hydrodynamic  variables  from  the 
area  weighted  variables.  Suppose,  for  the  moment,  that  we  have  done  this  properly. 
Then  when  the  grid  point  at  position  x is  to  be  moved  there  will  be  associated  with  it  a 
left  state  u_  and  a right  state  u+.  We  find  the  complete  solution  of  the  Riemann 


problem  for  these  two  states.  Then  we  decide  which  ray  or  rays  are  to  be  followed.  For 
example,  if  the  point  x is  a contact  discontinuity  and  if  the  solution  of  the  Riemann 
problem  has  a sufficiently  strong  contact  discontinuity,  then  we  take  the  new  speed  to 
be  the  speed  of  the  contact.  The  new  flux  F is  / — cru  evaluated  on  this  ray  (this  takes 
account  of  the  fact  that  the  ray  is  not  necessarily  vertical  in  the  space-time  plane).  Note 
that  / — cru  is  continuous  across  every  ray  in  the  solution  of  the  Riemann  problem. 
More  generally,  the  point  x might  spawn  several  new  moving  points.  If  x is  the  result 
of  a collision  with  another  point  or  with  a reflecting  boundary,  then  we  could  follow  all 
the  shocks  and  contacts  which  emerge. 

The  complete  logic  of  the  procedure  for  deciding  which  rays  to  keep  is  too  compli- 
cated to  give  in  complete  detail  here,  but  we  can  give  an  outline  of  it.  First,  the  Riemann 
solver  produces  a list  of  speeds  and  fluxes  and  identifiers  for  each  sufficiently  strong 
wave  which  is  present  in  the  solution.  Thus,  a shock  corresponding  to  the  characteristic 
v +c  is  identified  as  a 3-shock,  and  a speed  and  flux  are  given  for  it.  A rarefaction 
corresponding  to  the  characteristic  v — c is  identified  as  a 1-wave,  and  for  it  the  speeds 
and  fluxes  on  the  leading  and  trailing  edges  are  provided.  Next,  tactical  decisions  are 
made  in  a subroutine  called  TRACK,  which  has  the  job  of  creating  and  destroying  mov- 
ing points,  advancing  the  moving  points  and  checking  for  collisions,  maintaining  stabil- 
ity on  the  moving  grid,  and  communicating  with  the  most  refined  portions  of  the  fixed 
grid. 

Here  is  how  TRACK  works.  First,  the  points  are  collected  into  blocks.  Each  block 
is  such  that  the  rightmost  point  of  one  block  is  separated  by  five  or  more  full  fixed  cells 
from  the  leftmost  point  of  the  next  block,  as  in  Figure  3.2. 


Fixed  cell 
boundaries 


V 

v-  Moving  points 
in  one  block 


T. 

Mnvi 


Moving  points 
in  next  block 


Fig.  3.2.  Blocks  of  moving  points. 


Each  block  is  processed  independently  of  the  others,  so  what  follows  refers  to  the 
points  of  one  block.  In  order  to  improve  resolution  in  the  variable  area  case,  if  two 
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adjacent  points  are  two  or  more  full  cells  apart  some  fixed  grid  points  between  the  pair 
are  treated  as  moving.  These  are  called  separator  waves.  Next,  each  moving  point  is  pro- 
vided with  a left  and  right  average  value  of  the  area-weighted  variable  w so  that  if 
w_(x  ) and  w+(x  ) are  respectively  the  left  and  right  states  of  x . and  if  x and  y are 
adjacent  points  (x  < y ) then  w +(x  ) = w_(y ).  This  is  done  using  a combination  of  fixed 
cell  data  and  moving  point  data  obtained  from  eq.  (3.1)  for  the  previous  time  step, 
depending  on  the  separation  of  the  points.  Of  course,  this  is  done  conservatively.  The 
hydrodynamic  variable  corresponding  to  w+(x  ) is  u+(x  ) :=  >v+(x  )/a  (x  ,y  ).  Now  the 
Riemann  solver  is  called  for  each  point  in  the  block.  For  the  typical  grid  point  all  the 
rays  returned  by  the  solver  are  assumed  to  define  new  points  which  are  inserted  into 
the  list  of  moving  points.  There  are  exceptions  to  this;  for  example,  at  a left  reflecting 
boundary  only  rays  with  non-negative  speeds  are  retained.  The  list  is  ordered  by  posi- 
tion if  the  positions  are  unequal  and  by  speed  otherwise,  as  in  Figure  3.3. 


Fig.  3.3.  Ordering  of  moving  points. 

At  this  stage  we  have  many  more  points  than  we  want  or  need,  but  most  of  them  will 
be  deleted  at  the  end  of  the  time  step. 

The  reason  for  retaining  so  much  information  is  that  this  gives  us  a procedure  for 
maintaining  stability  during  a collision  or  close  approach  of  moving  points.  If  a collision 
occurs  at  time  t + St , 0 < St  < A t . the  current  block  of  points  is  advanced  to  t +8 1 
using  eq.  (3.1)  with  At  replaced  by  St.  Then  we  attempt  to  finish  the  time  step  by 
advancing  from  t + St  to  t + At , checking  again  for  a collision,  etc.  The  use  of  blocks 
causes  this  partial  time-stepping  to  be  spatially  localized,  unless  the  moving  grid  is 
evenly  distributed  in  the  fixed  grid.  The  idea  now  is  that  any  collision  which  occurs  at 
this  time  is  "exact."  which  means  the  following.  If  the  points  x and  y in  Figure  3.1 


collide  at  time  t + Af . then  x + cr*  A t — y + <xy  At . so  that  the  left  side  of  eq.(3.1)  is 
zero.  On  the  other  hand,  even  if  the  source  term  were  not  present,  the  right  side  of  that 
equation  cannot  be  expected  to  be  zero.  Indeed,  consider  the  case  shown  in  Figure  3.4. 


z 


In  this  situation,  a contact  and  a 1-shock  have  arrived  at  x and  y . respectively.  The 
Riemann  solver  has  produced  at  x a contact  with  speed  s and  a 3-rarefaction  wave, 
while  at  y the  solution  is  a 1-shock  and  some  other  waves  that  play  no  role.  Two  bad 
things  happen  if  we  suppress  the  rarefaction  wave.  First,  because  the  solution  is  not 
constant  along  the  ray  yz  we  can  expect  an  instability  to  develop.  Second,  making  the 
appropriate  substitutions  into  the  right  side  of  eq.(3.l),  we  have 

R i :=  (y  — x )a  (x  ,y  )u0-  Sr  [a  (x  a.  )su  j - a (y  x )cr,  u0] 

— St  [a  (y  (u0)  — a(x  j )/  (u  j)]. 

Even  if  the  area  factors  were  constant,  this  would  not  be  zero  unless  Uj  = u0-  On  the 
other  hand,  if  we  include  the  leading  edge  of  the  rarefaction  in  the  list  of  moving 
points,  then  the  first  collision  occurs  at  c.  Then  the  right  side  of  eq.(3.l)  becomes 

R2:~  8tf  (u0)[a  (y  O'  +<Ty  8t)-a(x  jc  +<JX  St )]. 

In  the  constant  area  case.  R2  — 0,  hence  the  nomenclature  exact.  In  other  words,  an 
exact  collision  is  one  in  which  the  state  between  the  two  intersecting  rays  is  constant. 
For  such  a collision  eq.(3.1)  is  identically  correct  if  the  area  is  constant.  When  the  area 
is  variable,  the  error  in  the  mass  and  energy  conservation  is  second  order  in  the  mesh 
size. 
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In  Figure  3.5  we  can  see  how  the  collision  between  the  contact  and  the  shock  will 
actually  occur. 


After  several  partial  time  steps,  caused  by  collision  of  the  precursor  rarefaction  wave 
with  the  shock,  the  rarefaction  wave  will  become  too  weak  to  be  seen  by  the  Riemann 
solver  and  the  main  collision  will  take  place.  There  will  be  a small  error  in  a conserved 
variable  such  as  the  mass.  The  program  controls  this  error  by  two  devices.  If  the  error 
exceeds  a pre-set  value  the  time  step  is  repeated  with  a smaller  strength  threshold  in  the 
Riemann  solver.  This  works  well  for  constant  area,  but  is  not  enough  in  other 
geometries.  For  them,  we  must  force  additional  partial  time  steps  that  will  reduce  R 2- 

At  the  end  of  the  time  step  (partial  or  complete)  the  precursor  waves  are  deleted. 
At  the  end  of  a full  time  step  the  separator  waves  are  also  removed.  Thus  each  step 
starts  fresh  with  the  main  moving  points.  However,  if  a major  collision  has  occurred, 
points  may  have  been  created  or  destroyed.  If  we  wish  to  keep  track  of  all  shocks  and 
contacts,  then  we  must  include  the  resulting  transmitted  and  reflected  shocks  and  resi- 
dual contact  produced  by  a collision  of  two  shocks  or  of  a shock  and  a contact.  The 
entire  process  that  we  have  described  works  remarkably  well,  particularly  if  collisions 
are  rare. 


4.  THE  TEST  PROBLEMS. 

Three  test  problems  are  presented.  The  first  is  Sod’s  problem  with  reflecting  boun- 
daries [l].  The  second  is  a problem  posed  by  Paul  Woodward  [5]  involving  the  interac- 
tion of  the  solution  of  two  Riemann  problems.  The  third  is  an  elegant  spherical  shock 


problem  with  a simple  exact  solution  due  to  Bill  Noh  [6]. 

In  Figure  4.1  we  give  our  solution  (density  only)  of  Sod’s  problem  at  t -.175.  The 
initial  data  define  a Riemann  problem  centered  at  x - .5.  The  left  state  has  density  1.0, 
pressure  1.0,  and  velocity  0.0.  The  right  state  has  density  .125,  pressure  .1.  and  velocity 
0.0.  The  equation  of  state  is  that  for  a y - law  gas  with  y * 1.4.  This  initial-value  prob- 
lem resolves  into  a rarefaction  wave,  a contact  discontinuity,  and  a shock  wave  (from 
left  to  right). 

1.0 
0.8 
0.6 
0.4 
0.2 
0.0 

0.0  0.2  0.4  0.6  0.8  1.0 

x 

Fig.  4.1  Sod’s  problem  at  t - .175  . 

The  shock  is  correct,  but  the  state  between  the  contact  and  the  rarefaction  is  in 
error  by  5%.  This  is  caused  by  the  presence  of  the  strong  rarefaction  in  close  proximity 
to  the  contact  early  in  the  calculation. 

In  Figure  4.2  we  give  the  apparently  converged  computed  solution  at  t - .81.  By 
this  time  the  main  shock  has  reflected  off  the  right  boundary  and  interacted  with  the 
contact,  producing  reflected  and  transmitted  shocks.  The  rarefaction  has  reflected  off 
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the  left  boundary,  begun  to  emerge  from  the  interaction  with  its  image,  and  is  just  now 
beginning  to  interact  with  the  main  pair  of  reflected/transmitted  shocks. 


Fig.  4.2  Sod’s  problem  at  t — .81  . 


The  initial  conditions  for  Woodward’s  problem  are  a gas  at  rest  with  unit  density 
in  a unit  interval  with  reflecting  walls.  The  pressure  in  the  left-most  1/ 10-th  of  the 
interval  is  1000  and  the  pressure  in  the  right-most  1/10-th  is  100:  it  is  .01  otherwise. 
The  initial  rarefaction  waves  moving  toward  the  boundaries  reflect  and  quickly  catch 
up  to  the  contacts  and  the  shocks.  The  collision  of  the  shocks  and  their  trailing  waves 
at  about  t — .028  initiates  a complex  sequence  of  intense  interactions  localized  within 
five  to  twenty  percent  of  the  interval.  The  computed  density  is  shown  in  Figure  4.3  at 
t = .038  . Woodward  has  computed  this  with  a very  fine  grid,  but  he  only  gives  a 
graph  of  the  solution.  We  differ  from  his  solution  only  in  the  magnitude  of  the  peak 
density,  which  he  finds  to  be  6.5  while  ours  is  7.  Both  calculations  locate  the 


discontinuities  in  the  same  places. 


Fig.  4.3.  Woodward's  problem  at  t = .038  . 


For  Noh’s  problem  we  have  a sphere  of  unit  radius  filled  with  a y-law  gas,  y - 
5/3.  at  zero  pressure  and  internal  energy,  and  with  velocity  - -1.  At  t = .6  the  solution 
consists  of  a shock  located  at  z - .2  moving  with  speed  1/3.  Behind  the  shock  the  pres- 
sure is  64/3  and  the  density  is  64.  Ahead  of  the  shock  the  density  is  1 + 1 /r2.  The  com- 
puted density  is  given  in  Figure  4.4. 


Density 


Fig.  4.4.  Noh’s  problem. 


5.  THE  PIECEWISE  LINEAR  EQUATION  OF  STATE. 

The  approximation  of  arbitrary  functions  by  piecewise  linear  ones  has  a long  and 
distinguished  history.  The  value  of  this  approximation  in  the  theory  of  conservation 
laws  seems  to  have  been  first  recognized  by  Dafermos  [7],  who  combined  a piecewise 
linear  flux  function  and  piecewise  constant  initial  data  to  obtain  an  elegant  existence 
theorem  for  scalar  conservation  laws.  The  crucial  property  of  the  piecewise  linear  scalar 
flux  is  that  the  solution  of  the  Riemann  problem  has  only  coni  cant  states.  Hedstrom  [8] 
observed  that  if  the  pressure  expressed  as  a function  of  specific  volume  and  entropy  is 
piecewise  linear  in  the  volume,  then  again  the  solution  of  the  Riemann  problem  has  only 
constant  states.  Hedstrom  used  this  as  a computational  device  to  obtain  numerical  solu- 
tions of  the  equations  of  isentropic  flow,  by  tracking  the  shock-like  boundaries  of  the 
constant  states.  In  principle.  AZTEC  can  obtain  the  exact  solution  of  the  full  gas 
dynamic  equations  with  such  a piecewise  linear  pressure  and  piecewise  constant  initial 
data  simply  by  having  no  fixed  grid  points,  only  moving  ones.  If  we  also  take  a very 
large  time  step,  then  the  collisions  determine  the  intermediate  time  steps.  Each  collision 
will  be  exact  in  the  sense  defined  in  section  3. 

In  Figure  5.1  we  show  the  solution  of  Sod's  problem  for  a piecewise  linear  approx- 
imation to  the  y - law  gas. 


Fig.  5.1.  Sod’s  problem  for  a piecewise  linear  equation  of  state,  t = .175  . 


There  are  80  nodes  per  decade  in  density.  The  shock  and  contact  are  now  exact  (for  the 
given  equation  of  state),  and  the  rarefaction  wave  has  become  piecewise  constant.  Of 
course,  this  is  a trivial  application  of  the  method  as  there  have  been  no  collisions. 

In  Figure  5.2  we  have  a more  interesting  example,  namely.  Sod's  problem  with 
reflecting  boundaries  at  t=.8l,  computed  with  80  nodes  per  decade  in  density.  Now  we 
have  a rarefaction  wave  reflected  off  the  left  boundary  and  interacting  with  the  waves 
reflected  from  the  other  boundary.  This  result  should  be  compared  with  Fig.  4.2.  The 
solution  in  Fig.  5.2  was  obtained  about  30  times  faster  than  the  one  in  Fig.  4.2. 


Fig.  5.2.  Sod’s  problem,  t = .81  . piecewise  linear  equation  of  state. 

There  are  serious  difficulties  with  the  piecewise  linear  method  which  seem  to 
prevent  it  from  being  more  than  a curiosity.  One  problem  is  that,  in  general,  a collision 
of  two  waves  will  produce  at  least  three  outgoing  waves,  leading  to  a rapid  prolifera- 
tion of  waves  and  collisions.  This  can  even  happen  with  the  interaction  of  rarefaction 
waves,  such  as  occurs  in  the  reflecting  Sod  problem.  In  the  case  of  the  interaction  of 
rarefaction  waves,  this  difficulty  is  overcome  by  the  following  device.  Suppose  the  pres- 
sure p at  fixed  entropy  is  continuous,  and  that  it  is  linear  in  the  intervals  (Tj.Tj+i), 
i — 1.2  • • • ji . where  r = 1/p.  We  constrain  the  nodal  values  p(  to  satisfy  the  condi- 
tion that  ( Pi  — Pi  +i)(Tj  +1  — r,  ) is  a constant  that  depends  only  on  the  entropy,  not  i . It 
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follows  from  this  that  the  velocity  jump  across  each  internal  ray  in  a rarefaction  fan  is 
constant.  Moreover,  an  examination  (as  in  [8])  of  the  rarefaction  curves  in  the 
pressure-velocity  space  shows  that  the  number  of  collisions  in  the  complete  interaction 
of  two  rarefaction  fans  is  now  of  order  n2  if  there  are  n nodal  pressure  values  in  the 
fans.  This  constraint  was  used  to  generate  Figures  5.1  and  5.2. 

We  anticipate  reporting  additional  detail  on  our  experiences  with  piecewise  linear 
equations  of  state. 
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ABSTRACT 

Finite  difference  approximation  of  transonic  flow  problems  is  a well- 
developed  and  largely  successful  approach.  Nevertheless,  there  is  still  a 
real  need  to  develop  finite  element  methods  for  applications  arising  from 
fluid-structure  interactions  and  problems  with  complicated  boundaries.  In 
this  paper  we  introduce  a least  squares  based  finite  element  scheme.  It  is 
shown  that,  if  suitably  formulated,  such  an  approach  can  lead  to  physically 
meaningful  results.  Bottlenecks  that  arise  from  such  schemes  are  also 
discussed. 


Research  was  supported  in  part  by  the  National  Aeronautics  and  Space  . 
Administration  under  NASA  Contract  Nos.  NAS1-17070  and  NAS1-18107  while  the 
second  author  was  in  residence  at  the  Institute  for  Computer  Applications  in 
Science  and  Engineering,  NASA  Lanley  Research  Center,  Hampton,  VA  23665-5225. 
Partial  support  was  also  provided  by  Army  Research  Office  under  Contract  No. 
DAAG29-83-K0084. 


467 


1 . INTRODUCTION 


In  this  paper  we  consider  the  approximation  of  transonic  flows  by  finite 
element  methods  based  on  a variational  method  of  the  least  squares  type.  The 
objective  here  is  purely  computational.  In  particular,  we  have  sought  to 
fully  exploit  the  ideas  arising  from  mathematical  analysis  of  such  methods 
(see,  for  example,  [1]  - [6])  and  directly  apply  them  to  a nontrivial 
transonic  flow  problem.  The  major  conclusion  drawn  from  this  work  is  that 
finite  element  methods — suitably  formulated — can  give  physically  meaningful 
results. 

There  is  a significant  and  largely  successful  array  of  finite  difference 
techniques  for  transonic  flows  (e.g.,  [17]).  Nevertheless,  an  assumption 
implicit  in  this  work  is  that  there  is  still  a need  for  stable  and  accurate 
finite  element  approaches.  First,  there  are  applications  from  fluid-structure 
interactions  that  would  benefit  from  the  availability  of  a finite  element  flow 
model.  Second,  there  is  the  issue  of  complicated  boundaries  in  the  flow 
field.  The  importance  of  the  finite  element  ideas  in  such  a context — while 
largely  untested — is  still  promising. 

Variational  principles  of  the  least  squares  types  have  a number  of 
valuable  computational  properties.  For  example,  the  algebraic  system 
generated  is  always  Herraitian  seraidef inite.  In  addition,  such  schemes,  if 
properly  formulated,  are  insensitive  to  equation  type,  be  it  hyperbolic 
(supersonic  flows)  or  elliptic  (subsonic  flows).  In  fact,  the  majority  of  the 
finite  element  ideas  that  have  been  used  for  hyperbolic  problems  to  date  tend 
to  be  either  implicitly  or  explicitly  of  the  least  squares  type. 

Least  squares  based  schemes  do  have,  however,  some  major  computational 
defects.  First,  they  tend  to  be  sensitive  to  singularities  and 
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discontinuities  in  the  flow  variables.  Moreover,  mesh  refinement  alone  does 
not  overcome  these  defects  [7].  Based  on  the  work  in  [7]  we  introduce 
weighted  least  squares  variational  principles,  which  in  combination  with  mesh 
refinement  is  capable  of  dealing  with  shocks  in  the  flow  field. 

In  Section  2 we  describe  the  basic  numerical  formulation,  and  outline  the 
essential  computational  properties  associated  with  the  approach.  A key 
feature  is  the  proper  choice  of  weighting  functions  to  use  in  the  least 
squares  functional.  A closely  allied  issue  is  the  density  modifications 
needed  to  rule  out  nonphysical  expansion  shocks. 

In  Section  3 we  present  sample  numerical  results.  As  a model  problem  we 
select  the  planar  potential  flow  over  a cylinder. 

Other  authors  have  considered  finite  element  approximation  of  transonic 
flows.  Selected  references  are  [18]  - [21]. 


2.  THE  LEAST  SQUARES  FORMULATION 

A 

We  consider  the  potential  flow  over  a body  0.  Let  u^  denote  the 
velocity  and  p the  density.  Then  a mass  balance  yields 


In  addition,  we  have 


div[puj  = 0. 


u = grad  <|> 


(2.1) 


(2.2) 


for  the  velocity  potential  <(>.  The  density  p is  given  as  a function  of  u 
by  the  Bernoulli  equation.  The  system  is  closed  by  specifying  the  normal 
velocity 
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u»n  = v (2.3) 

a 

at  the  boundaries  of  the  flow  region.  On  the  body  ft  the  no  flow  condition 

ii*ii  = 0 

applies.  We  assume  that  the  flow  region  is  contained  in  a box  B and  that 
(2.3)  is  specified  on  the  boundary  of  B.  Thus 

A 

ft  = B/ft  (2.4) 

defines  the  flow  region,  and  (2.1)  - (2.2)  hold  in  ft  with  (2.3)  holding  on 
the  boundary  Y and  ft. 

Since  the  flow  is  assumed  to  be  irrotational , (2.1)  - (2.2)  can  be 

replaced  with 

div(pu)  = 0 in  ft  (2.5) 

curl(u)  =0inft  (2.6) 

u*n  = v on  T.  (2.7) 

A least  squares  scheme  based  on  this  system  takes  the  form 

/ {|div(p_u)|2  + | curl (u.)  | ^}  = min,  (2.8) 

ft 

where  the  variation  is  taken  u in  some  finite  element  space  satisfying  the 
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boundary  conditions  (2.7).  Such  a div  — curl  system  has  proven  to  be  very 
effective  for  elliptic  systems  (subsonic  flows)  in  cases  where  the  density 
p = p(u)  and  the  velocity  field  u are  smooth  [8], 

Preliminary  results  indicate  that  with  appropriate  weighting  functions  on 
the  terms  in  (2.8),  the  nonsmooth  cases  can  be  treated  as  well.  Nevertheless, 
in  this  paper  we  shall  focus  attention  on  (2.1)  - (2.2)  and  least  squares 
schemes  of  the  form 


= min,  (2.9) 

where  v.  “ Pn.  is  the  mass  flow  and  w is  a weighting  function  to  be  chosen. 

In  this  setup  the  variables  are  the  potential  <(>  and  the  mass  flow  v. 

The  density  in  (2.9) 

. P ■ p(  | grad  <(>|) 

is  obtained  from  Bernoulli's  equation,  i.e., 

pY  1 = {i  - M*  (|  grad  <J>  | 2 - l) 

Thus,  (2.9)  is  a nonlinear  least  squares  formulation,  which  is  appropriate 

since  it  reflects  the  nonlinear  character  of  transonic  flow.  Once  a grid  is 

selected  (specific  examples  are  given  in  the  next  section),  the  minimization 
of  (2.9)  over  the  associated  finite  element  space  leads  to  a nonlinear  system 

K($_)$_  = F.  (2.10) 


/ 

n 


- - grad  <f> 


+ w| div  v| 
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In  all  of  the  numerical  examples  reported  in  the  next  section,  (2.10)  was 
solved  by  a combination  of  Newton^s  method  and  elimination.  Issues  related  to 
this  choice  for  the  equation  solver  will  be  discussed  in  the  next  section. 

There  are  three  main  cases  that  are  considered  in  this  paper: 

Case  1 : smooth  subsonic  flows, 

Case  2 : smooth  transonic  flows , 

Case  3:  transonic  flows  with  shocks. 

In  the  first  case  (2.9)  can  be  used  without  modification,  and  in 
particular  no  weighting  function  is  needed  (i.e.,  w e 1 can  be  used).  One 
does  need  special  grids  to  obtain  optimal  accuracy  (see  [1]),  and  the  criss- 
cross grid  pattern  which  satisfies  the  grid  decomposition  property  of  [1]  is 
used. 

In  the  second  case  a hyperbolic  region  appears  but  the  flow  field  remains 
smooth.  In  this  case  there  is  a loss  of  accuracy  in  the  hyperbolic  region. 
In  particular,  with  linear  elements  the  pointwise  accuracy  in  the  mass  flow  j/ 
drops  from  O(h^) — in  a generic  mesh  spacing — to  0(h).  This  can  be  corrected 
with  a suitable  choice  of  weighting  function  w,  and  details  are  given  in  [8]. 
This  modification  was  not  used  in  the  results  reported  in  this  paper  since  the 
hyperbolic  regions  in  question  were  too  small  for  the  suboptimal  accuracy  to 
have  a major  effect  on  the  qualitative  features  of  the  flow. 

The  third  case  is,  by  a wide  margin,  the  most  important  as  well  as  the 
most  challenging.  Here  we; use  a weight  w so  that  the  terra 


J 

n 


( , 1 2 

u 

<Jw|div  u | + 

- “ grad  <j> 

( 

P 

(2.11) 
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remains  meaningful.  In  addition,  modification  to  the  density  p = p(|grad  <j>  | ) 
must  be  introduced  so  that  nonphysical  expansion  shocks  are  eliminated. 

For  the  choice  of  the  weight  w,  we  follow  the  developments  introduced  in 
[7].  For  most  flows,  = pji  is  continuous  across  the  shock  [10].  Neverthe- 
less, it  does  not  follow  that  div  _v  is  square  integrable,  and  the  primary 
rule  derived  from  [7]  is  that  w be  chosen  so  that 

/ w|div  v|2  < ».  (2.12) 

n 

This  requires  that  w vanishes  appropriately  on  the  shock,  which  in  turn 
means  that  (2.11)  is  a least  squares  principle  in  a degenerate  \r  norm.  A 
point  of  significance,  on  the  other  hand,  is  the  fact  that  if  w vanishes  to 
minimal  order  on  the  shock  (in  that  (2.12)  still  holds),  then  optimal  0(h2) 
can  be  achieved  In  unweighted  L norms  provided  appropriate  mesh  refinement 
Is  introduced.  This  has  been  proved  rigorously  only  in  special  cases  (see 
[7]),  yet  the  numerical  results  In  the  next  section  seem  to  indicate  that  the 
principle  is  general. 

These  modifications  alone  do  not  yield  an  accurate  simulation  of  the  flow 
problem.  To  do  this  one  must  deal  with  the  presence  of  nonphysical  expansion 
shocks.  In  effect,  (2.9)  does  not  have  a unique  minimum,  neither  over 
infinite-dimensional  function  spaces  nor  over  the  finite-dimensional  finite- 
element  spaces.  One  can  have  expansion  shocks,  compression  shocks,  or  both. 
What  is  interesting  is  the  results  In  the  next  section  tend  to  Indicate  that 
the  case  where  both  type  of  shocks  appear  tends  to  be  the  stable  mode  for 
(2.10).  That  is,  an  arbitrary  choice  of  starting  vector  for  Newton's  methods 
applied  to  (2.10)  tends  to  converge  to  this  solution. 
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To  eliminate  expansion  shocks  we  consider  density  biasing  which  in  effect 
introduces  streamwise  diffusion  into  (2.1)  - (2.2).  Following  [11]  (see  also 
[12]  - [14])  the  modified  density  takes  the  form 

P = P “ yPs  4s,  (2.13) 

where  p is  the  derivative  of  the  density  p along  the  streamwise 
s 

direction*  Since  the  density  has  the  form 

p = p(  | grad  <|>|), 

the  derivative  pg  formally  involves  second  derivatives  of  <f>.  Since  4>  is 
expanded  in  terms  of  linear  elements,  it  is  necessary  to  replace  pg  with  a 
streamwise  difference  quotient;  i*e*, 

P = P - y AP  As,  (2.13') 

in  the  least  squares  formulation. 

3.  NUMERICAL  RESULTS 

To  illustrate  the  above  ideas  we  selected  the  classic  problem  of  a planar 
flow  past  a cylinder.  The  flow  region  plus  boundary  conditions  are  given  in 
Figure  3.1.  The  configuration  shown  in  this  figure  assumes  that  both  the 

outflow  and  inflow  remain  subsonic.  Figure  3.2  contains  a typical  grid.  For 
economy  only  the  top  part  of  the  flow  region  is  shown,  and  the  special 
refinement  needed  for  the  shocks  is  not  shown. 
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The  first  set  of  results  shows  a typical  subsonic  flow  pattern 


The 


results  are  given  in  Figure  3.3  for  a free  stream  Mach  number  of 


M = 0.1. 

00 


Convergence  studies  at  such  Mach  numbers  are  reported  in  [5]  - [6].  These 

results  indicate,  with  the  type  of  grid  shown  in  Figure  3.2,  one  can  readily 
o 

achieve  L error  of  1%  or  less  for  the  velocity  field. 

The  next  set  of  results  deal  with  the  smooth  transonic  case.  Of  special 
interest  here  is  the  ability  of  the  scheme  to  detect  the  onset  of  supersonic 
flow.  Analytical  techniques  (see  [15]  and  [16])  have  given  accurate  values 

for  the  critical  free  stream  Mach  number  M*  as  a function  of  d/D,  where 
d is  the  diameter  of  the  cylinder  and  D is  the  width  of  the  channel.  These 
results  are  reproduced  in  Figure  3.4.  Numerical  results  from  the  least 
squares  scheme  are  given  in  Figures  3.5  - 3.7  for  Mw  « .42,  .45,  and  .50, 

respectively.  The  d/D  ratio  used  for  this  case  is  1/6.  Extrapolation 

based  on  these  results  indicates  that  the  critical  Mach  number  is 
approximately  .41,  which  is  good  agreement  with  Figure  3.4. 

The  next  set  of  results  show  what  least  squares  based  schemes  produce  when 
diffusion  via  density  modification  is  not  used.  These  are  shown  in  Figure  3.8 
which  contains  plots  of  the  velocity  q **  |uj  versus  angle  0 along  the 
cylinder  and  at  a radius  slightly  above  the  cylinder.  The  free  stream  Mach 

number  is  M^  *»  .5.  The  shock  at  the  front  of  the  cylinder  is  an  expansion 

shock  and  is  nonphysical.  The  one  at  the  rear  is  a compression  shock.  A 
remarkable  feature  of  this  approximation  is  that  the  physically  relevant 
compression  shock  is  approximately  in  its  correct  position  and  is  apparently 
unaffected  by % the  spurious  shock.  (Compare  Figures  3.8  and  3.9.) 
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The  solution  shown  in  Figure  3,8  is  apparently  a stable  mode  for  the 
nonlinear  system  (2,10),  Indeed,  Newton's  method  converged  to  this  solution 
rather  rapidly  for  a wide  variety  of  initial  conditions. 

In  this  regard,  it  is  interesting  to  note  that  for  the  least  squares 
formulation  the  Jacobian  is  not  singular  near  the  solution  shown  in  Figure 
3.8.  Density  modifications  are  needed  to  remove  the  spurious  shock  shown  at 
the  front  of  the  cylinder.  However,  they  are  not  needed  to  obtain  nonsingular 
Jacobians. 

The  final  results  deal  with  the  complete  least  squares  system  with  the 
density  modification  discussed  in  the  previous  section.  Figures  3.9  - 3.11 
show  the  velocity  field  over  the  cylinder,  at  a radius  slightly  larger  that 
that  of  the  cylinder,  and  at  a radius  in  the  free  stream.  Note  that  the 
spurious  expansion  shock  has  been  totally  eliminated.  Moreover,  the  shock 
location  and  strength  as  well  as  the  velocity  profile  appear  to  be  correct  as 
is  the  supersonic  bubble  shown  in  Figure  3.12. 

While  we  regard  these  numerical  experiments  as  successful,  there  are  a 
number  of  areas  where  the  approach  could  be  improved.  The  first  issue 
concerns  the  equation  solver.  Once  the  density  modification  were  introduced, 
the  number  of  iterations  increased  by  a factor  of  2 to  3.  Moreover,  the 
solution  shown  in  Figure  3.9  tended  to  be  less  "attractive"  to  the  Newton 
Iterations  than  that  shown  in  Figure  3.8  (without  density  modifications).  In 
fact,  it  was  not  difficult  to  find  starting  vectors  where  nonconvergence  was 
seen,  in  the  former  case,  although  the  starting  state  of  a uniform  flow  always 
leads  to  convergence.  This  suggests  that  an  alternative  equation  solver 
(e.g.,  preconditioned  conjugate  gradient)  might  be  a more  efficient  choice  for 
the  equation  solver. 
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A second  issue  concerns  post-shock  oscillations.  These  are  seen  in  Figure 
3.10,  which  is  the  radius  where  the  oscillations  were  found  to  be  the  most 
significant.  These  oscillations  were  not  seen  on  the  body  of  the  cylinder 
(Figure  3.9)  and  disappeared  rather  rapidly  away  from  the  cylinder  (Figure 
3.11).  This  is  clearly  a grid  effect  due  to  the  slight  misalignment  of  shock 
and  grid. 


4.  CONCLUSIONS 

Finite  difference  approximations  to  transonic  flow  problems  are  well- 
developed  and  have  been  successfully  used  for  a wide  range  of  problems. 

Nevertheless,  there  is  still  a need  to  develop  finite  element  approaches  for 
such  problems  for  a variety  of  applications.  We  feel  that  the  results 
presented  here  do  show  that  such  schemes  can  give  physically  meaningful 

simulations. 

On  the  other  hand,  our  experience  has  tended  to  indicate  that 

straightforward  application  of  the  basic  finite  element  idea  may  not  always  be 
successful.  Key  computational  issues  are  as  follows: 

(i)  There  is  a need  to  carefully  develop  the  spaces  in  which  the 

2 

approximations  are  formulated.  Classical  L spaces  are  generally 
inappropriate. 

(ii)  Some  form  of  diffusion  (via  density  modifications  or  otherwise)  appears 
to  be  needed.  Moreover,  care  is  needed  in  the  way  this  diffusion  is 
introduced. 

(iii)  The  geometrical  pattern  of  the  grid  selected  is  of  importance.  Some 
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patterns  are  definitely  superior  to  others. 

Finally,  there  are  some  important  "bottlenecks"  associated  with  the  scheme 
employed  in  this  paper,  which,  if  properly  addressed,  could  lead  to  an  even 
more  efficient  approach.  These  include  the  following: 

(i)  There  is  a need  for  an  equation  solver  that  is  more  efficient  than  the 
Newton  method  used  in  this  paper. 

(ii)  There  is  a need  for  adaptive  grid  refinement  techniques  that  would  lead 
to  a better  shock  grid  alignment  than  that  achieved  in  this  paper. 
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Figure  3.3.  Flow  pattern  for  the  free  stream  Mach  number 
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Figure  3.8.  Velocity  as  a function  of  angle:  (a)  on  cylinder,  (b) 

slightly  off  cylinder  — M = .51. 
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Figure  3.9. 


Velocity  as  a function  of  angle  on  the  cylinder  — full  least 
squares  scheme  with  density  modification  — M = .5. 
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Figure  3.10. 


Velocity  as  a function  of  angle  slightly  off  cylinder  — full 
least  squares  scheme  with  density  modification  — = .5. 
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Figure  3.11. 


Velocity  as  a function  of  angle  half  radius  above  cylinder 
full  least  squares  scheme  with  density  modification  — M = . 
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Figure  3*12.  Supersonic  bubble  — full  least  squares  scheme  with  density 

modification  — M = .5. 
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ABSTRACT 

Helmholtz  type  boundary  value  problems  are  important  in  a variety 
of  scattering  and  diffraction  problems.  Standard  numerical  schemes 
based  on  finite  difference,  finite  element,  or  integral  equation 
methods  are  generally  not  well  suited  for  these  problems  in  the 
"intermediate  frequency  range"  since  the  oscillatory  solution  is  not 
accurately  approximated  by  piecewise  polynomials.  In  this  paper,  a 
version  of  the  weak  element  method  is  employed  to  numerically  solve 
these  problems  in  two  dimensions.  This  method  consists  of  parti- 
tioning the  domain  into  small  "elements"  and  locally  approximating  the 
solution  in  each  element  by  a sum  of  exponentials.  These  piecewise 
approximations  are  joined  together  at  interelement  boundaries  by 
continuity  conditions  for  certain  functionals  of  the  approximate 
solution.  The  method  is  analyzed  using  a complementary  variational 
formulation.  It  is  shown  that  the  weak  element  method  is  considerably 
more  accurate  than  standard  discretization  methods  when  the  solution 
is  adequately  approximated  locally  by  the  exponential  basis 
functions.  These  results  are  validated  by  numerical  experiments. 

The  submitted  manuscript  has  been  authored  under  contract  DE- 
AC02-76CH000 16  with  the  U.  S.  Department  of  Energy.  Accordingly,  the 
U.  S.  Government  retains  a nonexclusive,  royalty-free  license  to 
publish  or  reproduce  the  published  form  of  this  contribution,  or  allow 
others  to  do  so,  for  U.  S.  Government  purposes. 
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1 . INTRODUCTION 


It  is  the  purpose  of  this  paper  to  analyze  and  numerically 

investigate  the  weak  element  method  applied  to  Helmholtz  type  boundary 

value  problems  in  multi-dimensional  domains.  Scalar  and  vector 

2 

Helmholtz  type  equations,  (A  + K n)u  = 0,  with  an  appropriate 
radiation  condition  and  spatially  dependent  index  of  refraction,  n, 
are  of  importance  in  a variety  of  stationary  wave  propagation  problems 
occurring  in  acoustics,  optics,  seismology,  and  electromagnetic 
theory.  Since  the  solution  is  rarely  known  in  closed  form,  it  is 
important  to  approximately  solve  these  problems  numerically  in  the 
intermediate  frequency  range,  where  asymptotic  methods  can  be 
unreliable. 

When  applying  typical  discretization  methods  such  as  finite 
difference  and  finite  element  methods  as  well  as  integral  equation 
methods,  one  is  faced  with  the  "resolution  problem".  This  means  that 
in  order  to  approximate  the  solution  accurately  when  the  wave  number, 
K,  is  not  small,  one  must  decrease  the  grid  size,  h,  and  hence  solve  a 
prohibitively  large  number  of  linear  equations.  This  problem  arises 
from  the  use,  in  the  usual  discretization  methods,  of  piecewise 
polynomial  functions  to  approximate  a highly  oscillatory  solution. 
Methods  for  overcoming  this  difficulty  have  been  developed  in  [1]  and 
[2]  by  combining  the  finite  element  method  with  functions  satisfying 
the  desired  oscillatory  behavior.  The  method  in  [1]  was  developed  for 
one-dimensional  problems.  The  method  in  [2]  was  designed  to  treat 
multi-dimensional  problems  for  which  most  of  the  propagation  occurs  in 
a narrow  angle  band  about  a fixed  direction. 
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An  alternative  approach  for  discretizing  boundary  value  problems 
is  given  by  the  weak  element  method  developed  in  [3].  This  method  is 
based  on  partitioning  the  domain  into  small  subdomains  (elements)  and 
approximating  the  solution  in  each  element  by  a solution  of  a 
localized  approximation  of  the  differential  equation.  These  piecewise 
approximations  are  joined  together  at  interelement  boundaries  by 
continuity  conditions  for  certain  functionals  of  the  approximate 
solution.  See  [4]  and  [5]  as  well  as  references  cited  there  for  a 
discussion  of  related  methods.  In  this  paper  we  consider  a version  of 
the  weak  element  method  in  which  the  approximate  solution  consists  of 
piecewise  exponential  basis  functions  joined  together  at  interelement 
boundaries  by  imposing  continuity  conditions  on  the  average  values  of 
the  approximate  solution  and  its  normal  derivative.  This  method  is 
described  briefly  in  Section  2 and  in  detail  in  [3]. 

In  Section  3 we  analyze  this  weak  element  method  for  a model 
problem  in  a rectangle.  The  analysis  employs  a complementary 
variational  principle  developed  in  [4]  in  connection  with  the  Laplace 
equation.  Here  we  extend  the  arguments  in  [4]  to  a non-self ad joint 
Helmholtz  boundary  value  problem.  We  prove  that  when  K2h  is 
sufficiently  small,  the  resulting  discrete  problem  is  well-posed  and 
the  mean-square  discretization  error  is  of  order  0(K3h2)  as  h-K).  This 
is  analogous  to  the  situation  for  standard  second  order  finite  element 
or  finite  difference  schemes.  We  also  show  that  when  the  phase  of  the 
solution  is  adequately  approximated  locally  by  the  exponential  basis 
functions,  the  weak  element  method  is  much  more  accurate  than  standard 
discretization  schemes  as  K increases.  This  is  the  main  advantage  of 
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the  weak  element  method.  Some  techniques  for  approximating  the  phase 
of  the  exact  solution  are  described  in  [1]  and  [2].  In  Section  4 we 
demonstrate  the  results  of  some  numerical  experiments  with  the  weak 
element  method.  We  summarize  our  conclusions  in  Section  5. 
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2.  THE  WEAK  ELEMENT  METHOD 


In  this  section  we  outline  briefly  the  weak  element  method 
described  in  [3].  We  employ  the  following  notational  convention. 
Suppose  that  ..  >an)  and  b denote  vectors  with  n components, 

and  denotes  an  nxn  matrix  whose  ith  column  is  and  whose  jth 

row  is  j • We  denote  the  inner  product  of  a and  b by  a*b  and  the  norm 
of  a by  la^Ca’a)'^  No  notational  distinction  is  made  between  row 
and  column  vectors.  Hence  a in  a$  is  a row  vector,  but  a in  $>a  is  a 
column  vector. 

We  consider  the  following  differential  operator  acting  in  a 

bounded  domain  D in  the  x=(x^,X2)  plane  with  a piecewise  smooth 

boundary,  3D.  Suppose  that  P and  A are  2x2  matrices  (P  being  positive 

definite  symmetric)  and  b and  q are  scalars.  Let  n denote  the  outward 

directed  unit  normal  to  D and  let  V=(3/.  ,3/  ) denote  the 

9xl  3x2 

gradient.  The  linear  elliptic  operator  L is  defined  in  D by 

L=-V*  PV+q , (2.1) 

and  the  boundary  operator  B is  defined  on  3D  by 

B=n*AV+b.  (2.2) 

Before  proceeding  further  we  require  the  following  additional 
notation.  Let  n^(D)  denote  a partition  of  D into  N elements 
(subdomains),  {tt^}.  We  use  ff^(7r),  j=l  ,2, . . . , JJ(tt)  , to  denote  one  of 
the  £(x)  smooth  sides  of  the  element  tt.  The  vector 

o(Tr)=(a1(ir),a2(Tr),.. . (it))  denotes  the  sides  of  it  oriented  in  a 

counterclockwise  manner  about  tt.  A side  a ^ ( 7r ) , which  is  incident  to 
another  subdomain,  it'’,  is  an  interior  side  and  is  denoted  by  o(tt,tt'’). 
Otherwise  0j(Tr)  lies  on  3D  and  is  denoted  by  o'(tt).  (See  Figure  1 for 
the  case  of  rectangular  elements.) 
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The  area  of  tt  (length  of  0^)  Is  denoted  by  | tt  | ( | a | ) . Let 

0(tt)  denote  the  smallest  angle  between  the  centroid,  x°,  and  any  two 

distinct  vertices  of  it.  In  order  for  the  resulting  system  of  linear 

equations  to  be  well  conditioned,  we  assume  that  0 ( tt ) >0  >0  for  each 

o 

element  irell^(D),  where  6q  is  independent  of  N. 

/s 

We  define  localizations,  L( tt ) and  L(ir),  of  the  operator  L given 
by  (2.1)  with  respect  to  the  element  it  as  follows: 

L(Tr)=-V(Po7)-(VPo)*V+qo  (2.3) 

and 

A 

L(tt)=-V(P  V)+q  (2.4) 

00 

where  Pq  denotes  P evaluated  at  x^ , etc.  Finally,  if  u(x)  is  a smooth 
(possibly  vector-valued)  function  and  a_.(ir)  is  an  arbitrary  side 
of  tt,  we  define 

u(0  (Tr))=|^-  ,<£  u(x)ds 
J 1 J 1 J (tt) 

and 

u(o(n))  = (u(a1(iT))  ,u(a2(i;)) , . . . ,u(  (it)  ) ) . 


o2(tt) 


02(O 


o3(tt) 


aj(ir-) 


Figure  1 
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We  are  now  ready  to  describe  the  weak  element  method  employed 
here  to  solve  the  boundary  value  problem  Lu=0  in  D,  Bu=g  on  3D.  For 
each  element  tt,  let  *Sj(x,tt),  (^(x,^)  * • * * tt)  denote  a linearly 

independent  set  of  solutions  of  the  localized  equation 

A 

L(tt)«51=0  (2.5) 

and  define 


<Kx,  (x, it),  <|)2(x, it),  ... 

Our  approximate  solution  on  tt  is  now  defined  by  the  equation 

w(x,Tr)=i(x,7T)*a(7r),  (2.6) 

where  the  coefficient  vector  a(ir)=(a.  (it)  ,a„(ir) , . . . ,a. . (it))  is 

J.  Z TT  J 

unknown. 


Now  suppose  that  (^(ct)  is  incident  to  tt'  at  the  side 
0.^(0  and  let  o'(tt)  be  a side  of  it  on  the  boundary  3D.  We  impose  the 
following  continuity  and  boundary  conditions  on  w(x,tt): 


w(°j  (lf) )=w(°j  ^(’T  ) ) * (2.7a) 

(nj*PVw)(aj(TT))=(nj*PVw)(ar(TTO)  (2.7b) 

on  interior  sides,  where  n^  is  the  outward  directed  unit  normal  to 
Oj(tt),  and 

(Bw)(a'(ir))=g(o'(Tr))  (2.7c) 

on  boundary  sides.  Substituting  (2.6)  into  (2.7),  we  obtain  a system 
of  linear  equations  for  the  N vectors 

a(ir^)-(aj  (irjL)  > a2  (tt^  ) , • . . (^^ ) ) > 1 > • • * )N. 
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It  is  shown  in  [3]  that  the  weak  element  approximation  given  by  (2.6) 
may  be  obtained  by  solving  an  equivalent  smaller  system  of  equations 
for  the  average  values  of  w on  all  sides,  0^(77),  of  the  partition. 
Remark  2.1:  As  described  in  [3],  the  weak  element  method  can  be 

generalized  as  follows.  In  (2.7),  we  impose  boundary  (continuity) 
conditions  on  the  average  value  of  the  function  (and  an  appropriate 
derivative)  for  each  boundary  (interior)  side,  0^(77).  To  generalize 
the  method  we  can  replace  the  average  value  on  0^(77)  by  a set  of 
linear  functionals  on  aj(7T)>  denoted  by  <Am(  a.  ( tt)  ) ,u>  ,m=l , . . .M.  We 
would  then  require  M£(tt)  local  basis  functions  in  each  element.  (For 
example,  these  linear  functionals  might  consist  of  the  average  value 
of  higher  order  moments  of  u on  each  side.)  This  could  lead  to  higher 
order  methods  than  the  method  discussed  in  this  paper  for  which  M=1 
and  <A1(o^.  (tt))  ,u>=u(a^  (tt))  . 

For  the  sake  of  simplicity,  in  the  remainder  of  the  paper  we 
consider  the  special  case  in  which  each  element  tteII^D)  is  a rectangle 
with  sides  parallel  to  the  x^ ,x^  coordinate  axes.  As  will  be  seen  in 
the  next  two  sections,  a key  to  the  success  of  the  weak  element  method 
lies  in  the  choice  of  the  local  basis  functions,  <t^(x , tt)  , i=l , . . . , 4 . 

We  employ  an  exponential  basis  defined  as  follows.  Let  x0=(x°,X2) 
denote  the  center  of  tt  and  define  the  unit  vectors 
e^=(l,0)  and  e2=(0,l)*  We  now  set 


p . (x  -x°)  _ p (x  -x°)  _ -p  (x  -x°) 

4>1(x,ir)=e  ,<J>2(x,TT)=e  L ,<j>3(x,ir)=e  , 


and  <f>^(x,TO=e 


-p2(x2_X2) 


/ (2.8) 
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where  p^^  and  p 2 are  chosen  so  that  each  ^(x.tt)  satisfies  (2.5).  A 


simple  calculation  yields 


(2.9) 


Basis  functions  analogous  to  those  given  by  (2.8)  and  (2.9)  can  be 
obtained  by  solving  the  equation  L(tt ) <^^^=0  instead  of  (2.5). 

The  basis  functions  in  (2.8)  can  be  generalized  as  follows. 
Define  the  unit  vectors  ela=(cosa,sina)  and  e2a=(sina, cosa) 

TT 

with  0<a<— . Now  set 
4 


, . Plaelo-(;-x0)  ^ , p2ae2a* 

<J.la(x,ir)=e  ,<(>2a(x,TT)=e  * 


^3a^x,1T^=e 


-p.  e.  .(x-x°) 
la  la  ' 


> and  4>A„(x>ir)°e 


-P2ae2a* (^-x°> 


The  constants  p^a  and  p2a  can  be  determined  as  before  by  substituting 
(2.10)  into  (2.5).  Note  that  a can  have  different  values  in  different 
elements.  This  can  be  useful  when  some  knowledge  is  available 


concerning  the  phase  of  the  exact  solution  (see  Remark  3.1  below). 

The  finite  difference  equations  obtained  using  basis  (2.8)  were 
derived  in  [3].  See  [6]  for  a detailed  investigation  of  the  result- 
ing finite  difference  formulas  using  both  (2.8)  and  (2.10)  and  for 
various  aspects  of  the  implementation  of  the  method.  The  resulting 
system  of  equations  may  then  be  solved  for  the  unknowns, 
aj ) > j =1 » • • • > i=l , . . . ,N.  The  weak  element  approximation,  w(x), 
is  obtained  from  (2.6).  Hence  we  obtain  w(x)  at  each  point  x in  D 
instead  of  only  at  nodal  points.  Observe  that  the  resulting  matrix  is 
highly  sparse.  Furthermore,  the  corresponding  large  system  of 
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equations  is  nonself adjoint  with  indefinite  symmetric  part  for 
problems  of  the  kind  considered  in  this  paper.  The  preconditioned 
iterative  method  developed  in  [7]  is  well  suited  for  solving  this 
system  of  equations.  This  iterative  solver  has  not  been  implemented 
in  connection  with  the  weak  element  method  at  the  present  time. 
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3.  ERROR  ANALYSIS 


In  this  section  we  consider,  for  the  sake  of  simplicity,  the 


following  model  problem: 

(a)  (-A-(K2+i6K))u=0  in  D, 


(b)  u=g  on  3D, 


| (3.1) 


J 


where  D is  the  unit  square,  6>0,  and  we  assume  that  the  solution 
2 

ueC  (D).  The  term  i6K  is  chosen  to  simulate  a radiation  condition  as 
in  [8],  Furthermore,  it  is  easily  seen  that  this  term  ensures  the 
well-posedness  of  (3.1).  We  set  K'=/K2+i6K  and  note  that  q=iK" 
in  (2.1)  and  b=l  in  (2.2).  Furthermore,  P(A)  is  the  2x2  identity 
(null)  matrix  in  (2.1)  ((2.2)). 

We  shall  employ  the  weak  element  method  described  in  Section  2 

with  local  basis  functions  given  by  (2.8)  and  (2.9).  Hence  we  have  a 

partition  of  D,H^=H^(D),  into  small  rectangular  elements, 

1T^>  i=l,.  «.,N,  such  that  the  local  basis  functions  defined  on  it  are 

i 

given  by 

+iK'(Xl-Xj)  +iK'(x2-xJ) 

e » e > (3.2) 

where  (x^.x^)  denotes  the  centroid  of  it  . Denote  the  lengths  of  the 
horizontal  and  vertical  sides  of  ir^  by  h^  and  l^L  respectively,  and 
define 


h=max 

7T . err 


N 


max(h 


i 

1 


(3.3) 


We  shall  analyze  the  discretization  error  using  a complementary 
variational  formulation  developed  in  [4]  for  the  Laplace  equation 
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Before  describing  the  variational  formulation,  we  introduce  some 

additional  notation.  For  a fixed  element 

ienN>  let  CTj  i = crjCTT±),j  = l,.  ..  ,4,  denote  the  four  sides  of  (see 

Figure  1 above)  and  set  o(o.  . )=1  (-1)  if  a.  . is  to  the  right  or  top 

J 9 1 J ,1 

of  (to  the  left  or  bottom  of)  tt#*  If  ct.  =o,^  , „ is  a common  side 

1 J y i J > 1 

of  tk  and  tt T j veR  (tt^)  flH  (^C),  and  v^(v^)  t^ie  restriction  of  v 
to  tt^ ( , we  define 

(a)  6v  =p(o.  )v  +p(a.^  ,^)vT  for  each  interior 
°4  4 J > 1 1 J t 1 1 

J > 

side  a.  . =a.„  and 
J .1  J ,i 


(b)  6v 


=p(o  )v  -p(o.  . )g  for  each  boundary 
j ,i  J’  1 J’1 

i dp  fl  / 


(3.4) 


side  a.  .. 

J ,i 

We  next  define  some  Sobolev  and  piecewise  Sobolev  spaces  that  are 
important  in  the  variational  formulation.  Suppose  B=D.  By  Hm(B),  we 
denote  the  space  of  functions  v such  that 


IMI2  = ,Z,.  | | d“v 
11  "m(B)  M<mM  1 


I ? 

L (B) 


<°°, 


where  m is  a non-negative  integer  and  D denotes  a derivative  of 
order  |a|.  Let  Hq(D)  denote  the  closure  of  Cq(D)  with  respect  to  the 


norm 


>||  111  • We  define 

IT(D) 


H ={veL2(D):  v 2 = E _ v.  2 <»}. 

1111  h it  . ell  11  i 1 1 m,  . 

Hm  l N H (tt  ) 


We  also  define 


2 N 2 
| V I h=  . E | V I 2 
Rm  i=l 


for  each  veHm  , 
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where  the  seminorm,  | | , is  defined  by 

Hm(iri) 

| v | 2 = E | | D^v | | 2 0 for  each  veHm(7r.), 

H1"^)  | a|=m  1 

Finally,  set 


i ^ i ^ 2 

={veH  : (-A-(k  +i6K) ) v^=0  for  each  i^ell  }. 


We  now  define  the  subspace  by 

K. 


E_  1 

H = veH^  *V  ^as  cont:*-riuous  normal 

derivatives  on  8n . for  each  ir.ell  - 
l l N 

Furthermore,  we  define  the  following  bilinear  form: 


h N _ 

Aj^Cv.w)^^ (Vv.Vw*-(K  +i6K)vw*)dx  ¥v,weH  , 


(3.5) 


* 

where  w denotes  the  complex  conjugate  of  w.  It  is  easily  seen  that 

the  solution,  u,  of  (3.1)  satisfies  the  following  variational  problem: 
E 

Find  ueH  such  that  ^ 


A^(u,v)=ru(v)  = E <£  g|^-ds  for  each  veH 


a . 

3, 


l (VP) 
i 


where  the  summation  is  taken  over  all  element  sides,  o'.  .,  contained 

3 

in  3D,  ds  denotes  arc  length,  and  — denotes  the  outward  directed 

dn 

normal  derivative  to  3D. 
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We  discretize  (VP)  by  defining  a finite  dimensional  subspace, 

b E h b 

S CH  , as  follows.  Let  A denote  the  functions,  ip  , defined  on  the 
K, 

element  sides  a.  . such  that  ip  is  some  constant,  c.  . on  a.  .•  For 

h h J>1  J’1 

,b  . Ah  h . 1 . c 3v  f v .h 

each  ip  m A , let  v in  H^.  satisfy  7^—  = p(a.  ^)ip  on  each 

a nj  3,1 

side  ot  , , where  — denotes  the  outward  directed  normal  derivative 
J.i 

J h 

to  a.  f = a.(7T.)  from  tt,#  Hence  v is  the  solution  of  a well-posed 
J,i  J 1 i 

Neumann  problem  in  each  element.  Let  S consist  of  all  such 

K 

h h h E 

functions  v • By  the  construction  of  S , we  see  that  S CH  . We  now 

K.  K 

formulate  our  discrete  variational  problem. 


h h 

Find  u £S  such  that 
K 

Ah,  h hN  „ f hN  „ h h 
A^(u  , v )=r  (v  ) Vv  eSK. 


\ 


) (DVP) 


/ 


Note  that  the  weak  element  and  finite  element  methods  are  based  on 
complementary  variational  principles  in  the  sense  that  essential 
boundary  or  interface  conditions  for  one  are  natural  conditions  for 
the  other. 

We  next  show  that  (DVP)  is  equivalent  to  the  weak  element  method 

h 

described  in  the  previous  section.  Suppose  that  u satisfies  (2.7) 
with  local  basis  functions  given  by  (3.2).  In  view  of  the  definitions 
of  P and  B corresponding  to  problem  (3.1),  it  follows  from  (2.7)  and 


(3.4)  that 


3u 

(a)  &>  6 (~ — ) ds=0  for  each  interior  side  a.  . 

o , . dn . _ 1,1 

i,i  j a.  , J 9 

J’  J J,i 


and 


(b)  & 6u  ds=0  for  each  side  a.  .. 

j,i  j,i  2,1 


\ 


> (3.6) 


/ 
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The  local  basis  coefficients,  aCi^  )=(a  ^ (7^)  , . . . , a^(ir^))  , are 

determined  in  each  u.  so  that  (3.6)  holds.  It  follows  from  (3.2)  that 
9uh  1 

is  constant  on  each  a..  Hence  it  follows  from  (3.2)  and  (3. 6) (a) 
nj  h h J 

that  u eS^.  Furthermore,  it  is  seen  from  (3.5),  (3.6)(b),  and 

integration  by  parts  that  (DVP)  holds.  Conversely,  it  is  easily  seen 

that  if  u satisfies  (DVP),  then  (3.6)  and  consequently  (2.7)  (with  w 

replaced  by  u^)  are  also  satisfied.  The  basis  coefficients  a(ir^)  may 

be  readily  obtained  as  in  [3]. 

It  thus  suffices  to  prove  that  (DVP)  is  well-posed  and  to 
ll  ll  tl 

estimate  the  error,  e =u-u  , where  u and  u satisfy  (VP)  and  (DVP), 
respectively.  To  this  end  we  first  state  the  following  result. 

Lemma  3.1;  Suppose  that  ip  satisfies  the  following  boundary  value 
problem: 

(-A-(K2-i 6K) )ip=z  in  D,  i|>=0  on  9D.  (3.7) 

Then 

I Ml  2 =CKMzM  2 ’ <3. 8) 

HZ(D)  l/(D) 

where  C is  independent  of  K and  z. 

Note  that  we  shall  often  use  the  same  letter  C to  denote 
different  constants  when  there  is  no  danger  of  confusion.  Lemma  3.1 
was  established  in  [8]  and  shows  how  the  norm  of  the  resolvent 
operator  for  (3.7)  depends  on  K.  This  Lemma  was  also  established  with 
the  Dirichlet  condition  replaced  by  a radiation  boundary  condition  on 
part  of  the  boundary.  In  such  cases  K is  replaced  by 

a with  0<a<l,  where  a depends  on  various  factors.  See  [8]  for  a 
more  complete  discussion  of  these  issues. 
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We  are  now  ready  to  prove  our  error  estimates.  We  first  prove 


the  following  Lemma  using  a duality  argument  as  in  [4], 

Lemma  3.2:  Suppose  that  u satisfies  (VP),  u satisfies  (DVP), 

and  e^u-u*1.  Then  there  exists  a constant,  C,  independent  of  K and  h 


such  that 


I eh | | 2 <CKh|eh|  . 
L vD)  Rlh 


Proof:  First  observe  that 

, |/  e z*dx| 

1 1 e 1 1 2 = sup~  TTiTl ' 

l/(D)  zee  (D)  11  Ml2(d) 


(3.9) 


1 00 

Let  4>eH0(D)fiC  (D)  denote  the  solution  of  (3.7).  For  each  vertical 

(horizontal)  element  side,  a.  .,  let  -r—  denote  » — (t — ).  It  follows 

J»!  on  dX^  dX^ 

from  (3.4),  (3.5),  (3.7),  and  integration  by  parts  that 


fDenz*dx=fDe^(-A-K'^)\p  dx 


si,j  K.  /v  .lf,is+4(eh'w' 

3»i  J.i 


Vi  E 1 11 

Since  e eH  CH  , it  follows  readily  using  (3.5)  and  integration  by 


parts  that 


Combining  this  with  (3.10),  we  deduce 


r h * - r h * 

/ e z dx=-J  & 6e  3^  ds. 


1>J  °J,i  °j,i  9n" 
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To  estimate  (3.11),  we  first  divide  each  rectangular  element, 
ni,  into  two  right  triangles  as  shown  in  Figure  1 

(with  n and  it  replaced  by  and  tt^).  For  simplicity,  consider  the 

triangle  containing  sides  a and  a.  . . Denote  this  triangle  by  t. 

i,i  4,1  i 

and  set  t • Set 


Y"  and  bM“  (3'I2) 

where  |o  | ,j=l  or  4,  denotes  the  length  of  side  o.  . . Define 
J J » i 


YVYiVYi^ 


and  note  that 


atj 

K ds= °»  J=1  or  4- 

J >i 


(3.13) 


Since  ueC(D),  it  follows  from  (3.6)(b)  that 
6 6e^  ds=0  for  each  side  a.  . . 

ai,i  °:.i  j>i 


(3.14) 


Using  (3.14)  and  a scaling  argument  as  in  [4],  it  may  be  seen  that 


3 l«e*  |2ds<Ch(|eh|  2 +|eh|2  ), 

j,i  j >i  H1(tri)  IT(irp 


(3.15) 


where  C is  independent  of  h and  (and  j = l in  Figure  1). 


3^  3*p 

Note  that  ^ — is  constant  on  o Combining  this  with 

(3.14),  we  obtain 
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6e  „ 
o . . a.  . 3n 
J.i  J.i 


h ’'A  , ,h  3*l 

— ds=®  o e 

' rt  rt 


a.  .a.  . 8n 
J.i  J »i 


^is. 


(3.16) 


It  follows  from  Schwarz'  inequality  and  (3.15)  that 


l*o.  ,6v  .liHaisll{v  ll  2.  . 

J.i  J.i  j,i  L (a  ) 


3*/ 


> (3.17) 


1/  . , . 
<Ch/2(|eh|  +|.h| 

h‘(i..)  h‘(.J)  Jn 


Using  an  argument  in  [4]  (see  Lemma  2.2.6)  and  (3.13),  we  deduce 


1/ 

2 <<*'2\\\  2 

LZ(0j>;.)  1 H2(t.) 


<Chly,2 


I 2 

H (tt±) 


(3.18) 


Combining  (3.8)  with  (3. 16)-( 3. 18)  and  the  Schwarz  inequality,  we 
conclude  that 


l£,j  K.  .6e0.  .|^sl<CKh||z| 
J.l  J>1 


I 2 I e I h* 
l/(D)  Rlh 


(3.19) 


Finally,  we  combine  (3.9),  (3.11)  and  (3.19)  to  complete  the  proof. 
Q. E. D. 

We  now  prove  our  main  result. 

2 

Theorem  3.1:  Suppose  that  u satisfies  (3.1)  and  ueC  (D).  Then  for 


K h sufficiently  small, 


(3.20) 


there  exists  a unique  solution  u satisfying  (DVP)  and 
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(3.21) 


| eh | 2 h = C ^ h (|u-vh|2  h+K2| |u-vh| |2  ). 

'..i  v eST,  h1  LZ(D) 


H 


K 


Furthermore , 


Kh|eh|  h+||eh||  2 <CKh2||u||  . 

Hlh  LZ(D)_  W>) 


(3.22) 


The  constant,  C,  in  (3.21)  and  (3.22)  is  independent  of  K,  h,  and  u. 
Proof:  First,  assume  that  uh  exists.  Using  (3.5),  we  immediately 

obtain 

o O U 0 

" , (3.23) 


I eh  | 2 <|Aj(eh,eh)|  + |(KZ+i6K)M|en|r2  ■ 

1h  " * lz(d) 

rl 


Employing  Lemma  3.2  and  condition  (3.20),  we  may  "kickback"  the  last 
term  in  (3.23)  to  obtain 


hi  2 


<c|A^(eh,eh)| 


H 


1‘ 


In  view  of  (VP)  and  (DVP),  the  last  estimate  yields 


| e^1 1 2 ^ <C|  A^(e\u-v^)  | for  each  v*1  in  S^. 
H1 


(3.24) 


It  follows  readily  using  (3.5)  that 


|Ajl(eh,u-vh)|<|eh|  Ju-vh|  +(K2+i6K)  | | eh | | 2 ||u-vh|| 

* " Rlh  Hlh  LZ(D)  LZ(D) 


<U(|eh|2  h+K2|  |eh|  |2  KC  (|u-vh|2  h+K2|  |u-vh|  |; ) 

Hlh  LZ(D)  71  LZ(D) 
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for  arbitrarily  small  n>0.  Combining  this  estimate  with  (3.24)  and 
again  applying  Lemma  3.2,  (3.20),  and  a "kickback"  argument,  we  obtain 
(3.21). 

We  next  prove  (3.22).  In  view  of  (3.24),  we  construct  a 

function  v*1  in  sjj  satisfying 
K. 

|A^(eh,u-vh)|<C  h2| |u| |2  +p|eh|2  (3.25) 

W2(D)  Rlh 

for  ri>0  arbitrarily  small.  Suppose  in  JI^  has  sides 

. ,j=l,... ,4.  We  define  four  constants  c,  .,...,c,  such  that 

J>i  l,x  4,i 


K.  .cj,ids=  K.  .p(aj,i)l^ls»j=1’*“’4> 

J > 1 3 > i i 


(3.26) 


where  n^  is  the  outward  directed  unit  normal  to  9tt^ . It  is  easily 
seen  using  the  Mean  Value  Theorem  for  integrals  that  for  each 
j=l , . . . ,4 , we  have 


c.  . =p(o. 

J»i  3" 


>^,i> 


for  some  point  P.  ,ea . ..  It  follows  readily  using  the  Mean  Value 
J y 1 J > 1 

Theorem  for  derivatives  that 


J J1  J > 1 

j = l,...,4.  We  now  define  ^ in  Ah  by  i|)h|  =c . 


ds )%  (3.27) 


) 


for  each  side  a. 

3 ,i 
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It  follows  immediately  from  (3.26)  and  (3.27)  that  for  each  a.  , we  have 

J » i 


9u,  Vi 

(a)  $a  ~fa~ds~$,a  PC o . .)i|>  ds  and 

j,i  i j >i  J’ 


3u 


3 

2, 


(b)  ||^--p(a  )t"||2  <Chz ! | u | | , 

i h (a  ) W>) 

J.i 


(3.28) 


To  construct  v satisfying  (3.25),  we  solve  the  Neumann  problem  for 

equation  (3.1)(a)  in  each  77 ^ with  Neumann  data  given  by 

p ( a . # on  each  side  a.  j«l,...,4*  We  denote  the  solution  by 
J j 1 J y 1 

h h h 

v . Set  v =v,  in  each  tt.  and  note  that  v eS  . We  now  employ  integration 
i 11  K 

by  parts  to  deduce 

h,  h h.  „ r h 8 , h * OQv 

A^(e  ,u-v  )=£  j> 97r  e -^-(u-v  ) ds.  (3.29) 

i i h 

It  follows  immediately  from  the  construction  of  v and  (3. 28) (a)  that 

k 


'dn  1 (u-v^)  ds=0  for  each  n^. 

i "i  h 1 

In  view  of  this,  we  may  replace  e on  the  right  hand  side  of  (3.29)  by 


h h i , 
e^=e  -c  in 


(3. 30) (a) 


with  c^3i * — e^ds,  so  that 

I S^l  ^i 


e^ds=0  for  each  ir^.  (3.30)(b) 

Applying  (3.28)(b),  (3.29)  (with  eh  replaced  by  e^  on  the  right  side),  and 
the  Schwarz  inequality,  we  obtain 
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|A^(eh,u-vh)|<1S1  ||ej| 


2,. 


_ 4./  V Ull, 

L(a.il)  i 


(3.31) 


<Ch2|  |u||  t Mejll 

W (D)  1 L^Ott  . ) 

1 


To  estimate  | |e.  | | , we  map  tt  . onto  the  unit  square,  tt, 

1 l/(3ir.)  1 

1 


and  employ  the  following  well-known  estimate: 

I I w I I 2 1 =C^IW|21  I ^ 87TWcis  I for  ea°h  w in  H1(tt). 

H (tt)  H (tt)  c7T 


As  in  [4],  we  combine  this  estimate  with  (3.30)(b)  and  map  tt  back  onto 
tt  . to  obtain 

l 


I ei  I I 2 <ChV2|eJ|  -Ch/2|eh|  , 

L ( 3tt± ) 1 H (tt± ) IT(ir  ) 


(3.32) 


using  (3.30)(a)  in  the  last  step.  We  now  combine  (3.31)  and  (3.32)  to 
conclude  that  for  arbitrarily  small  ri>0: 


|Aj)(e\u-vh)|<Ch2  Z | |u||  |eh| 

WZ  (D)  H^tt.) 


(3.33) 


< ill  (Cnh4Mull22  +^eh|2i  >' 

^ WZ(D)  H^tt.) 


Estimate  (3.25)  now  follows  from  (3.33)  since  N=0(h  ).  Combining  (3.24) 

and  (3.25)  with  n>0  sufficiently  small,  we  deduce 


|en|  h<Ch||u||  . 

H1 


(3.34) 
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Estimate  (3.22)  now  follows  from  (3.34)  and  Lemma  3.2,  Finally,  to  prove 
that  (DVP)  is  well-posed,  it  suffices  to  prove  uniqueness  since 
SK  is  finite  dimensional.  If  g=0  in  D,  then  u=0  in  D since  (3.1)  is  well- 
posed.  Hence  it  follows  from  (3.22)  that  u^=0  in  D and  we  have  proved  that 
(DVP)  has  a unique  solution.  Q.E.D. 

Remark  3.1:  Typically,  the  solution  of  (3.1)  satisfies  ||u||  =0(K2). 

w2(d) 

oo v ' 

■l  no 

Hence  it  follows  from  (3.22)  that  | |e  | | =0(K  h ).  This  is  analogous 

l (d) 

to  results  obtained  for  standard  second  order  finite  element  or  finite 

difference  schemes  (see  [8]).  However,  it  follows  from  (3.21)  that  the 

weak  element  method  is  clearly  superior  for  moderate  to  large  values  of  K 

when  the  oscillatory  behavior  of  the  solution  is  well  approximated  in  each 

element  by  the  local  basis  functions,  assuming  the  "stability"  constraint 

(3.20)  holds.  This  constraint  also  occurs  in  connection  with  standard 

discretization  schemes.  Our  numerical  results  indicate  that  this  stability 

constraint  does  not  cause  serious  computational  problems  for  the  weak 

element  method  when  the  oscillatory  behavior  of  the  solution  is  well- 

tl 

approximated  by  functions  in  s".  We  shall  see  in  Section  4 that  in  such 

K. 

cases  the  discretization  error  is  quite  small  even  when  K^h  is  large. 

Remark  3.2:  It  follows  from  the  previous  remark  that  the  main 

computational  advantage  of  the  weak  element  method  occurs  when  the 
oscillatory  behavior  of  the  solution  is  well-approximated  by  functions 
in  SR.  The  determination  of  this  oscillatory  behavior  can  be  difficult  for 
realistic  physical  models.  This  question  was  investigated  in  [1]  and 
[2].  In  [1],  asymptotic  methods  were  employed  in  connection  with  a one- 
dimensional scattering  problem.  In  [2],  multi-dimensional  models  were 
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treated  for  which  it  is  known  that  most  of  the  propagation  occurs  in  a 
narrow  angle  band  about  a fixed  direction.  This  condition  is  closely 
related  to  the  "paraxial  approximation"  and  holds  in  a variety  of 
application  areas. 

Remark  3.3:  The  weak  element  method  described  in  Section  2 can  be  extended 

in  various  ways  (see  Remark  2.1).  Alternatively,  the  variational 
formulation  described  in  this  section  can  be  generalized  by  employing 
higher  order  approximating  subspaces  SR.  See  [4]  for  a detailed  discussion 
of  this  in  connection  with  the  Laplace  equation.  Furthermore,  more  general 
boundary  value  problems  can  be  treated  than  (3.1).  This  includes  more 
general  domains,  variable  coefficients,  and  radiation  boundary 
conditions.  We  intend  to  investigate  some  of  these  questions  in  the 
future. 
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4.  NUMERICAL  ANALYSIS 


In  this  section,  we  demonstrate  the  effectiveness  of  the  weak 
element  method  described  in  Section  1 for  simple  two-dimensional  test 
problems  whose  solutions  are  known  in  closed  form.  Our  measure  of 
error  is  given  by 


u-u 


E2  E 


*2(d) 


HU2(d) 


h 

where  u(u  ) is  the  exact  (approximate)  solution,  | | | | denotes 

JTCD) 

the  discrete  mean-square  norm,  and  D is  a rectangle  in  either 
Cartesian  or  polar  coordinates.  D is  partitioned  into  rectangular 
elements  as  described  in  Section  2 such  that  the  grid  points  are 
equally  spaced  in  each  direction.  We  denote  the  number  of  intervals 
in  the  x1  and  directions  by  Nj  and  N , respectively.  The 
differential  operator  is  given  by  (3.1) (a)  with  6=0. 

Our  boundary  condition  for  the  first  two  examples  is  the 
Dirichlet  condition,  (3. l)(b) , although  we  have  obtained  analogous 
results  for  various  combinations  of  Dirichlet,  Neumann,  and  impedance 
boundary  conditions.  In  Example  3,  we  consider  the  Helmholtz  equation 
in  polar  coordinates  in  the  exterior  of  the  unit  circle  with  a 
radiation  boundary  condition  imposed  on  an  artificial  outer 
boundary.  Our  main  purpose  in  all  of  these  examples  is  to  evaluate 
the  discretization  error  for  different  values  of  , N2,  and  K.  The 
calculations  were  performed  on  a CDC  7600  at  Brookhaven  National 
Laboratory.  The  system  of  equations  were  solved  using  a standard 
conjugate  gradient  iterative  method  applied  to  the  normal  equations  as 
well  as  a direct  solver  based  on  Gaussian  elimination.  Both  methods 
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resulted  in  essentially  the  same  discretization  errors.  It  is 
expected  that  more  recently  developed  preconditioned  iterative 
methods,  such  as  that  discussed  in  [7],  would  be  considerably  more 
efficient. 

Example  1 : For  our  first  series  of  numerical  experiments,  we  assume 

that  D is  the  unit  square  and  choose  Dirichlet  boundary  conditions 

such  that  the  solution  is  given  by 

u(x1,x2)  = sin  K(x^cos  a + x2  sin  a),  0 < a < — . (4.1) 

We  employ  the  weak  element  method  with  local  basis  functions  given  by 

(3.2)  (with  Kf=K).  Our  results  are  demonstrated  in  Tables  1A  - D with 

Ni  - N2  = N = h 1.  We  have  also  employed  the  five-point  finite 

difference  scheme  in  this  and  the  following  example  although  it  is  not 

necessary  to  demonstrate  the  results  obtained  using  this  scheme.  It 

suffices  to  observe  that,  as  expected,  this  finite  difference  scheme 

o 

has  convergence  rate  0(h)  as  hlO  with  K fixed.  (Our  numerical 
results  indicate  that  this  is  also  the  case  for  the  weak  element 
method.)  Furthermore,  the  five-point  scheme  is  not  accurate  when 
Kh  = | > 1. 

It  is  readily  seen  from  (3.2)  and  (4.1)  that  when  a=0  or  a=j,  the 
solution  in  each  element  may  be  expressed  as  a linear  combination  of 
local  basis  functions.  Hence  we  would  expect  the  weak  element 
approximation  to  yield  the  exact  solution,  except  for  accumulated 
roundoff  errors.  This  is  validated  in  Table  1A  for  N=4  and  various 
values  of  K.  On  the  other  hand,  it  follows  from  (3.2),  (3.21),  and 
(4.1)  that  the  more  a differs  from  0 and  y , the  less  effective  the 
weak  element  method  should  be  with  this  basis  (see  Remark  3.1).  In 
Tables  IB  - D,  we  consider  various  values  of  N and  K with 
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IT  71  TT 

ot=y-^,  — , and  — , respectively.  We  see  from  Table  IB  that  when 
u 

cx  = the  wea^  element  method  is  accurate  even  when  Kh  = 16.  From 

Ti- 

Table  1C  we  see  that  for  a = — , the  method  yields  accurate  results 
when  Kh  = 2 and  hence  is  more  effective  than  the  five-point  finite 
difference  scheme.  On  the  other  hand,  when  a = we  see  from  Table  ID 
that  the  method  does  not  yield  accurate  results  when  Kh  > 1.  We  have 
observed  that  in  cases  such  as  this  for  which  the  phase  of  the 
solution  is  not  sufficiently  well  approximated,  there  is  no  advantage 
in  using  the  weak  element  method  instead  of  a standard  discretization 
scheme. 
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Table  1C 


Table  ID 


Example  2:  For  our  next  class  of  problems,  we  consider  solutions  of 


the  form 

/f~2  2 

l . ^ L K -L  x2,L=1,2 

with  Dirichlet  boundary  conditions  on  the  unit  square. 


(4.2) 
Our  local 


basis  for  the  weak  element  method  is  again  given  by  (3.2).  For  Kh«l, 
the  convergence  rate  is  again  0(h  ).  However,  for  Kh»l,  the  weak 
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element  method  behaves  differently  for  this  problem  than  for  the 
previous  example.  The  reason  for  this  is  that  the  x^  dependence  of  u 
is  independent  of  K.  Suppose  that  K»L  in  (4.2),  so  that 
u(Xj  ,X2)/vsinLx^cosKx2.  Hence  the  X2“dependence  of  u may  be  reproduced 
almost  exactly  by  the  basis  functions  for  K large  and  the  accuracy 
will  be  almost  independent  of  the  number  of  X2  grid  points.  We  are 
thus  left  with  approximating  sinLx^  by  constants,  yielding  an 
O(N^)  order  approximation  to  u that  is  independent  of  K for  large 
K.  We  illustrate  typical  results  in  Table  2 for  L=l,  K=128,  and 
various  values  of  and  * 


Table  2 
(L=l , K-128) 


N1 

n2 

E2 

8 

1 

9.5  x 10-2 

16 

1 

4.9  x 10-2 

32 

1 

2.5  x 10~2 

64 

1 

1.3  x 10-2 

128 

1 

6.4  x 10-3 

8 

2 

9.3  x 10“  2 

16 

2 

4.8  x 10_2 

32 

2 

2.4  x 10"2 

64 

2 

1.2  x 10"2 

128 

2 

6.1  x 10"3 

8 

4 

9.3  x 10”  2 

16 

4 

4.8  x 10”  2 

32 

4 

2.4  x 10"2 

64 

4 

1.2  x 10"2 

128 

4 

5.8  x 10-3 

i 
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Example  3 : The  final  problem  we  consider  is  one  treated  in  [9]  using 

an  integral  equation  approach  combined  with  a finite  difference  method 

for  K not  too  large.  Introducing  polar  coordinates,  (r,6),  the 

2 

problem  consists  of  solving  the  Helmholtz  equation,  (A+K  )u=0  in  the 

exterior  of  the  unit  circle,  subject  to  the  boundary  conditions 

2 

u(x)=x  on  r=l  and  the  radiation  condition 

( 3u/3r)-iKu=o(r  (4. 3) 

for  r large.  The  solution  of  this  problem  is  given  by 


X2  H^^dCr)  sinOH  ^(Kr) 

u(x) V „ (!),„,  = TUT,  : 


(4.4) 


H 


1 


(K) 


H 


1 


’(K) 


(1) 


where  denotes  the  Hankel  function  of  first  kind  and  order  1. 

In  order  to  determine  the  discretization  error  due  to  applying 
the  weak  element  method  to  this  problem,  we  replace  the  right-hand 
side  of  the  radiation  condition  (4.3)  by  the  exact  value  obtained  by 
applying  (3/3r)-iK  to  (4.4)  on  a circle  of  radius  R>1  and  denote  this 
function  by  g(R,0).  Employing  polar  coordinates  and  appropriate 
symmetry  conditions  on  u(x),  we  obtain  the  following  boundary  value 
problem  for  u(x) : 

!r^rfr)+r  ^ +k2™=°  in  D’ 
u=sin0  on  r=l, 

(3u/3r)-iKu=g(R,0)  on  r=R, 

(3u/30)=O  on  0= tt/2  and 
u=0  on  0=0, 
where  D denotes  the  domain  l<r<R,  O<0<tt/2. 

Problem  (4.5)  may  be  placed  in  the  general  framework  of  (2.1)  by 
replacing  the  (x^,x^)  coordinates  by  (r,0)  coordinates,  so  that 


(4.5) 


V=(3/3r, 3/30).  Hence  the  domain  D is  a rectangle,  the  matrix  P is 


given  by 


P=(r  ° ) 

* ^0  1/r'  * 


and  q=-K  r in  (2.1).  Since  ( VP)q=( 1 ,0)/0 , we  simplify  the  problem  by 
making  the  transformation 

u(r,0)=r  1/,2v(r,0). 

We  now  obtain  the  following  boundary  value  problem  for  v: 

(_V  v+q ^ ) v=0  in  D,  ^ 

v=sin0  on  r=l , 

- (iK+|^)v=R1/2g  on  r=R,  / (4.6) 

■|^=0  on  0=j  and 


v=0  on  0=0, 


where 


P'=(X  ° ) and  q'=-{K2+-~r). 

Or  4r 


Hence  (VP")  =0.  Using  (4.4),  we  see  that  the  solution  is  given  by 


r1/2sin0H. (1)(Kr) 

v(r,0)=r1/2u(r,0)= . 

H U;(K) 


(4.7) 


We  now  apply  (2.8)  and  (2.9)  to  obtain  the  following  local  basis 
functions: 

e±i(K2+l/4ro2)1/2(r-ro),  e±irQ(K2+l/4ro2) 1/2( 0-0q) . (4.8) 


We  also  note  the  following  asymptotic  representation  of  H^v  '(Kr) 
(see  [10]): 
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(4.9) 


H](1)(Kr):(4-)1/2e1(Kr‘3,,/4) 

1 *Kr  j-0 


(-i)jr(j+|) 

j!(2iKr)jr(-j+|) 


for  Kr  large  and  J>1,  where  T denotes  the  gamma  function. 

If  we  compare  (4.7)-(4.9),  we  see  that  the  r-dependence  of 

v(r,0)  is  accurately  reproduced  by  the  basis  functions  for  Kr  large. 

This  is  analogous  to  the  situation  in  Example  2.  In  Table  3A,  we 

examine  the  error,  E2,  for  R=2  and  different  values  of  K and  N=N1=N2, 

where  Nj(N2)  is  the  number  of  subintervals  in  the  r(0)  direction.  For 
£ 

— small,  the  weak  element  method  again  behaves  analogously  to  the 

five-point  finite  difference  scheme.  On  the  other 'hand,  E2  is  nearly 

constant  for  — large  and  N fixed  as  K increases. 

Furthermore,  we  have  observed  that  for  larger  values  of  R the 

K K 

errors  are  about  the  same  as  for  R=2  when  — is  large.  When  — is 
small,  accuracy  is  destroyed  by  the  coarse  grid  in  the  r-direction. 
This  can  be  remedied  by  using  a graded  mesh  in  which  the  r-grid  sizes 
are  systematically  increased  as  r increases  (see  [11]).  We  illustrate 
the  high  frequency  behavior  in  Table  3B,  where  K=R=128.  In  this  case 
the  grid  sizes  in  the  r-direction  are  quite  large.  We  observe  that  E2 
is  essentially  constant  when  N2  (the  number  of  intervals  in 
the  0-direction)  is  fixed  and  varies.  Furthermore,  the  error  with 
respect  to  0 is  of  order  0(N2  1).  The  explanation  of  these  numerical 
results  is  the  same  as  that  given  in  example  2 (i.e.,  the  0-dependence 
of  v(r,0)  is  approximated  locally  by  constants). 
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5.  CONCLUSIONS  AND  COMMENTS 


We  have  analyzed  and  tested  a version  of  the  weak  element  method 
developed  in  [3]  in  connection  with  the  Helmholtz  equation. 
Mathematical  models  of  this  kind  occur  in  various  scattering  and 
diffraction  problems.  Standard  discretization  schemes  based  on  finite 
difference,  finite  element,  or  integral  equation  methods  are  not  well 
suited  for  these  problems  when  the  wave  number,  K,  is  not  small,  since 
piecewise  polynomials  are  not  good  approximations  to  the  oscillatory 
solution.  On  the  other  hand,  the  weak  element  method  is  based  on 
piecewise  exponentials  that  satisfy  a localized  approximation  to  the 
differential  equation. 

We  have  proved  that  the  particular  weak  element  method  outlined 
in  Section  2 with  mesh  size  h has  a convergence  rate  of  order  0(h)  as 
h-K)  for  fixed  K.  Our  analytic  results  also  indicate  that  the  method 
yields  a good  approximation  to  the  solution,  u,  when  the  oscillatory 
behavior  of  u is  well  approximated  by  the  local  basis  functions.  The 
proof  is  based  on  a complementary  variational  principle  developed  in 
[4]  in  connection  with  the  Laplace  equation.  It  is  expected  that  this 
proof  can  be  extended  to  more  general  boundary  value  problems  and 
higher  order  weak  element  methods. 

We  have  also  validated  our  theoretical  results  with  respect  to 
test  problems  for  which  the  solution  is  known  in  closed  form.  We  have 
seen  from  these  examples  that  the  weak  element  method  offers  no 
advantage  in  general  compared  to  the  five-point  finite  difference 
scheme.  However,  our  theoretical  and  numerical  results  demonstrate 
that  the  weak  element  is  considerably  superior  for  moderate  to  large  K 
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when  the  oscillatory  behavior  of  the  solution  is  adequately 
approximated  locally.  For  general  variable  coefficient  problems,  this 
oscillatory  behavior  will  vary  in  different  parts  of  the  domain. 
Consequently,  an  important  practical  area  of  investigation  is  the 
development  of  methods  for  determining  locally  the  approximate 
oscillatory  behavior  of  the  solution  for  large  K.  This  was  done  in 
[1]  in  connection  with  a one-dimensional  scattering  problem  using 
asymptotic  methods.  This  was  also  done  in  [2]  for  multi-dimensional 
propagation  models  for  which  most  of  the  propagation  occurs  in  a 
narrow  angle  band  about  a fixed  direction.  The  use  of  error 
estimators  and  adaptive  discretization  methods  might  also  be  useful  in 
determining  appropriate  local  basis  functions. 
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ABSTRACT 

A methodology  is  established  to  cluster  points  along  curves  in  a 
manner  which  does  not  change  the  existing  pointwise  distribution  out- 
side of  a specified  region  containing  the  cluster.  In  each  instance, 
points  are  pulled  from  the  perimeters  of  the  region  towards  the  clus- 
ter center.  The  result  is  a smooth  expansion  from  each  end  followed  by 
a compression  into  the  center.  Altogether,  this  represents  a local 
redistribution  of  points  which  can  be  employed  either  interactively  or 
automatically. 
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INTRODUCTION 


When  a pointwise  distribution  along  a curve  is  acceptable  every- 
where except  in  certain  local  regions,  the  capability  to  redistribute 
points  only  in  those  regions  becomes  important.  Our  objective,  here, 
is  to  create  a framework  from  which  methods  for  the  desired  local  re- 
distribution can  be  developed.  This  is  done  by  forming  elementary 
operations  which  are  then  applied  in  succession.  Each  operation 
smoothly  forms  a single  cluster  about  a point  by  attracting  only  near- 
by points:  the  pointwise  locations  beyond  a specified  distance  on 

either  side  remain  unchanged.  As  such,  the  action  occurs  in  an  in- 
terval where  both  the  endpoints  and  the  internal  cluster  point  remain 
fixed:  the  other  points  move  in  from  each  side  while  maintaining  a 

globally  smooth  variation  in  pointwise  spacing. 

Upon  application,  a new  pointwise  distribution  is  created  from  an 
old  one  and  differs  from  it  only  in  the  chosen  interval.  The  old  or 
"previous"  distribution  is  always  viewed  as  a mapping  from  a uniformly 
distributed  independent  variable  to  the  curve.  This  variable  is  often 
referred  to  as  just  the  existing  parameterization  for  the  curve.  With 
a finite  number  of  points,  a uniformly  spaced  grid  along  the  paramet- 
ric interval  is  mapped  onto  a grid  along  the  curve.  The  new  distribu- 
tion is  simply  constructed  by  composition  whereby  we  first  map  a new 
uniformly  distributed  parametric  interval  onto  the  old  one  and  then 
apply  the  old  map  to  the  result.  In  terms  of  grids,  a uniformly 
spaced  grid  on  the  new  parametric  interval  is  mapped  onto  the  old  par- 
ametric interval  to  produce  a distribution  there  which  is  non-uniform 
in  some  local  subinterval.  On  that  subinterval,  the  application  of 
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the  old  mapping  is  accomplished  with  the  aid  of  local  interpolation. 
Off  of  that  subinterval,  the  points  coincide  with  the  old  parametric 
locations  and  no  interpolation  is  required. 

At  each  stage,  there  is  a progression  from  old  to  new  correspond- 
ing to  the  application  of  an  elementary  operation.  As  noted  above, 
each  such  operation  can  be  generated  from  a local  reparameterization 
which  is  just  a mapping  between  old  and  new  parameters.  The  actual 
construction  can  be  done  in  either  forward  or  backward  directions  by 
the  use  of  weight  functions.  The  forward  direction  is  from  new  to  old 
while  the  backward  is  from  old  to  new. 

WEIGHTS  AND  TRANSFORMATIONS 

With  the  view  of  larger  masses  pulling  more  strongly  to  a center 
of  gravity,  weights  are  most  commonly  thought  of  as  being  more  strong- 
ly attractive  when  they  are  large.  For  the  application  to  distribu- 
tion functions,  this  view  means  that  points  are  more  strongly  at- 
tracted to  locations  of  large  weight.  In  correspondence,  the  point- 
wise  spacing  must  then  shrink  to  adjust  to  a large  weight.  The  most 
simple  way  to  have  this  happen  is  to  make  the  spacing  vary  in  an  in- 
verse proportion  to  the  weight.  In  terms  of  changing  the  spacing  in 
the  old  parametric  interval,  we  must  then  force  the  product  of  the 
weight  and  the  desired  spacings  to  be  equal  to  a constant.  When  the 
same  interval  is  taken  for  the  old  and  new  parameters,  that  constant 
is  just  the  increment  from  a uniform  spacing.  In  our  development,  we 
will  always  let  s denote  the  old  parameter  and  t denote  the  new  param- 
eter. In  this  notation  and  with  our  assumption  of  the  same  interval, 
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the  forced  condition  is  given  by 


dt  = w ds  (l) 

and  is  called  an  equi distribution  of  the  weight  w since  equal  amounts 
of  weight  must  appear  in  each  interval  ds.  For  grids,  the  total 
weight  between  every  pair  of  points  is  then  the  same. 

To  construct  the  elementary  operations  of  local  clustering,  we 
recall  the  basic  requirements  that  the  cluster  center  and  the  interval 
endpoints  must  remain  unchanged.  For  the  weights,  these  requirements 
become  integral  statements;  namely,  that  the  integrals  of  wds  and  ds 
are  the  same  over  both  of  the  intervals  from  the  cluster  center  to  the 
endpoints  of  the  cluster  region.  Noting  that  uniform  spacing  would 
occur  if  w = 1 , deviations  therefrom  are  responsible  for  non-uniform 
spacing  and  can  be  represented  as  a function  f which  is  added  to  the 
unity  of  uniformity  in  the  weight  to  get  w = 1+f.  In  terms  of  f,  the 
preservation  of  cluster  center  and  endpoints  results  when  the  integral 
of  f vanishes  over  each  of  the  two  intervals  above.  To  define  these 
intervals  in  a clear  way,  a zero  subscript  will  be  employed  for  the 
center  while  a minus  and  a plus  will  be  used  as  subscripts  to  indicate 
the  endpoints  in  negative  and  positive  directions,  respectively,  from 
the  center.  In  this  notation,  the  preservation  condition  means  that 
new  t and  old  s must  satisfy  t_  = s_,  tg  = Sg,  and  t+  = s+  or  that  the 
function  f which  gives  variations  from  uniformity  must  integrate  to 
zero  both  from  s_  to  Sq  and  from  Sq  to  s+. 

To  obtain  a maximal  amount  of  control  over  shape,  such  a function 
is  best  created  from  a piecewise  polynomial  construction.  The  sim- 
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plest  of  these  constructions  is  accomplished  with  two  adjoining  line 
segments  for  each  of  the  two  intervals.  This  is  depicted  in  Fig.  1 
where  it  is  clearly  evident  that  the  first  segment  from  either  end 


f 


Figure  1 : The  function  which  must  be 

added  to  unity  to  form  a weight  for  Eq.  1 

must  lie  below  the  axis  to  create  negative  areas  which  are  enough  to 
balance  the  positive  area  caused  the  linear  rise  to  the  positive  value 
at  the  center.  The  center  value  determines  the  intensity  of  the  clus- 
tering: the  sum  with  unity  gives  the  weight  wQ,  and  hence,  the  min- 

imum spacing  dt/wQ.  In  compensation  for  the  smallest  spacing  at  the 
center,  the  spacing  must  first  increase  and  then  start  to  decrease. 
This  starts  from  each  endpoint  spacing  and  linearly  increases  to  a 
maximum  at  the  end  of  each  segment  below  the  axis.  Upon  forming  the 
weight  w with  an  addition  to  unity,  the  maximum  spacing  on  each  side 
is  given  by  dt/w  with  w evaluated  at  each  corresponding  end.  Aside 
from  the  obvious  limitation  on  the  maximum  spacing  imposed  by  the 
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total  interval  length,  there  is  the  basic  limitation  that  the  weight  w 
must  be  positive:  negative  weights  flip  the  incremental  intervals; 

thereby,  causing  a singularity  in  the  mapping  and  a folded  grid.  As  a 
consequence,  there  is  then  a limitation  also  on  the  intensity  of  clus- 
tering at  the  center.  This  is  caused  by  the  required  balancing  of 
positive  and  negative  areas  in  Fig.  1. 

Obeying  the  intensity  limitation,  the  elementary  clustering  oper- 
ation is  obtained  by  a direct  integration  of  Eq.  1 with  our  weight. 

The  consequent  mapping  is  then  expressed  with  the  new  parameter  t 
given  as  a monotone  function  of  the  old  parameter  s.  In  correspond- 
ence with  the  linear  segments  of  the  integrand,  t is  expressed  as  a 
piecewise  quadratic  function  of  s.  To  apply  the  mapping,  a uniformly 
distributed  t must  produce  the  desired  non-uniformities  in  s which  in 
turn  are  sent  to  the  curve  by  using  the  old  curve  mapping.  This  is 
just  the  composition  of  going  from  t to  s and  then  to  the  curve.  By 
construction,  however,  we  go  from  s to  t which  is  backward.  This 
means  that  t(s)  must  be  inverted  to  obtain  s(t)  which  is  forward  rath- 
er than  backward  and  thus  can  be  used  directly.  Fortunately,  in  this 
piecewise  quadratic  case,  the  analytical  inversion  is  possible  and  is 
somewhat  simple.  Since  it  contains  radicals,  it  is  not  as  simple  as 
the  original  backward  construction. 

With  the  motivation  towards  more  simplicity  and  higher  levels  of 
clustering  intensity,  we  shall  consider  forward  rather  than  backward 
constructions.  To  accomplish  this,  we  must  invert  our  thinking  about 
weights,  and  thereby,  have  points  attracted  to  locations  where  the 
weight  is  smaller  rather  than  larger  as  would  have  been  expected  when 
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compared  to  the  center  of  gravity  shifts  for  masses.  In  terms  of  the 
pi ecewise-1 inear  construction  depicted  in  Fig.  1,  the  inversion  re- 
sults in  a rigid  reflection  about  the  horizontal  axis  and  a relabeling 
of  that  axis  to  be  for  the  new  parameter  t in  place  of  the  old  s. 

This  is  displayed  in  Fig.  2 and  as  earlier  is  added  to  unity  to  form 
the  weight  w = 1 +f  which  is  now  used  in 

ds  = w dt 

For  notational  consistency,  the  new  parameters  t_  = s_,  tQ  = sQ,  and 


Figure  2:  The  function  which  must  be  added  to  unity 

to  form  a weight  for  the  forward  mapping  with  Eq.  2 


t+  = s+  are  used.  The  equalities  also  result  from  the  rigidity  of  the 


reflection. 


Rather  than  derive  the  algebraic  formulation  directly  from  t_  to 
tQ  and  then  from  tQ  to  t+,  we  first  re-examine  the  basic  constraint 
which  led  to  the  equalities  above;  namely,  that  the  integral  of  f 
over  each  interval  vanishes.  This  constraint  must  be  satisfied  not 
only  by  the  function  displayed  in  Fig.  2 but  also  by  any  function 
which  is  to  be  employed  for  the  same  purpose.  To  begin  our  re-exam- 
ination, we  first  note  that  the  two  integrals  still  vanish,  if  we 
rigidly  translate  the  function  along  the  t-axis.  The  translation  is 
just  the  transformation  from  t to  t+c  from  some  c.  Moreover,  we  also 
note  that  the  vanishing  is  preserved  under  a constant  dilation  or 
contraction  of  either  vertical  or  horizontal  axes.  These  are  just 
transformations  which  scale  an  axis  by  scalar  multiplication.  In  the 
horizontal  case,  it  is  the  transformation  from  t to  at  for  some  a. 

The  effect  of  either  transformation  is  to  multiply  the  vanishing 
integral  by  a finite  constant,  and  thereby,  preserve  the  vanishing. 

In  more  formal  terms,  the  constraint  is  invariant  under  the  groups  of 
transformations  for  translation  and  scaling.  As  a practical  conse- 
quence, we  can  derive  our  algebraic  formulation  with  standard  condi- 
tions for  height  and  interval  and  then  apply  the  transformations  to 
get  the  formulation  for  any  other  conditions  that  we  wish.  This  also 
means  that  the  same  derivation  can  be  used  for  the  intervals  on  each 
side  of  tQ;  and  consequently,  reduce  the  complexity  of  derivation  by 
half.  A further  reduction  comes  from  selecting  the  unit  interval  and 
a unit  height  since  the  arithmetic  will  be  simplified. 

With  the  unit  lengths  for  our  standard  conditions,  we  are  led  to 
consider  the  function  shown  in  Fig.  3 where  the  juncture  point  loca- 
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1 and 


tion  x = a is  arbitrary.  From  a given  downward  unit  f (1)  = - 

a 

the  requirement  for  equal  areas  above  and  below  the  x-axis,  we  find 
that  f must  cross  the  x-axis  at  1/(2-a)  and  have  a value  of  1-a  at 
a.  This  function  is  then  uniquely  determined  by  the  value  at  a which 
is  also  indicated  in  the  figure.  From  the  figure,  the  algebraic  form- 
ulation is  directly  seen  to  be 


f 

a 


Figure  3:  A standard  function  for  the 

construction  of  piecewise  linear  weights 


f«(*) 


(l-o)  (2) 

or 


(1-0)  - <2-a>(£2) 


for  0 £ x ^ a 


for  a < x £ 1 


otherwise 


As  a matter  of  interpretation,  a represents  the  location  of  maxi- 
mal spacing  while  the  intersection  point  1/(2-ot)  represents  the  loca- 
tion where  the  spacing  starts  to  decrease  beyond  the  original  uniform 
spacing.  This  means  that  the  desired  impact  of  clustering  becomes 
significant  only  after  the  intersection  point  since  we  must  first  re- 
cover from  the  large  spacing  at  a.  Thus,  1/(2-a)  is  the  break-even 
point.  As  a varies  through  its  possible  range  from  0 to  1 , the  break- 
even point  varies  f rom  V2  to  1.  In  order  to  provide  a reasonably 
gentle  transition  into  the  smallest  spacing,  it  is  preferable  to  have 
a large  region  for  the  progression  in  spacing  to  occur.  The  largest 
possible  region  would  have  a length  of  V2  and  would  occur  when  a van- 
ishes. This,  however,  would  leave  no' room  for  a smooth  transition 
from  endpoint  spacing  to  the  maximum  spacing  at  a:  a reasonably  sized 

region  is  needed  here  for  the  same  reasons  as  in  the  situation  with 
the  smallest  spacing.  Thus,  a compromise  is  needed.  As  an  example, 
we  consider  the  case  with  a = The  transition  into  large  spacing 
then  occurs  over  a third  of  the  length  while  the  final  compression 
after  the  break-even  point  occurs  over  the  last  40?  of  the  length. 

The  corresponding  function  is  then 


2(3-5x) 


for  0 ^ x < — 
for  I < x £ 1 
otherwise 
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where  for  notational  convenience,  we  have  dropped  the  subscript  of 
1 which  would  have  been  required  from  the  specialization  of  Eq.  3. 

With  the  function  for  ot  = ^ , a weight  for  the  forward  mapping  is 
given  by 


w 


t-t  t -t 

1 + sWfhH  + r(r=t-H 

co  - + co 


(5). 


for  t * tg  and  by  w = 1 - 8 for  t = tQ.  The  special  treatment  of  tQ 
is  required  to  remove  a discontinuity  caused  by  a contribution  of  -1 
from  f on  each  side  when  otherwise  only  one  nonzero  value  would  ap- 
pear. The  coefficient  B is  a control  on  the  intensity  of  clustering. 
In  a direct  sense,  the  spacing  at  the  center  is  scaled  by  1-8  to  pro- 
duce a smallest  spacing  in  s.  This  spacing  comes  from  Eq.  2 which 
gives  (l-B)dt  at  tQ.  For  an  n-point  grid,  it  becomes 
(1-8) (tmax-tmin^/^n-1 ^ ' As  ® varies  from  0 to  1 , the  minimum  spacing 
varies  from  the  original  spacing  down  to  0.  To  avoid  singularities, 
we  do  not  go  down  to  0 but  rather  are  interested  in  coming  arbitrarily 
close  to  0.  Unlike  the  earlier  backward  mapping,  there  is  no  price 
for  this  arbitrary  level  of  clustering  intensity.  This  occurs  because 
the  compensating  areas  for  grid  expansion  are  now  in  the  positive  di- 
rection where  there  is  no  limit  on  size  as  there  previously  was  when 
the  axis  itself  was  being  approached. 

By  use  of  the  weight  of  Eq.  5 in  Eq.  2,  we  obtain  the  forward 
mapping 


s 


+ sKvtjg(gi-)  + (t0-t+)g(^-)} 


(6a) 
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where 


g(x) 


■jj-(1-x)(5x-1 ) 


for  0 £ x < 1 
for  1 S x £ 1 
otherwise 


> 


j 


(6b) 


is  the  integral  of  f for  increasing  x.  The  interval  lengths  multiply- 
ing each  application  of  g result  from  a change  of  variable  in  each 
corresponding  integral.  Geometrically,  g appears  as  a simple  bump 
which  leaves  the  axis  (g(0)  = 0)  with  zero  slope  (g* (0)  = 0),  monoton- 


ically  increases  in  the  positive  direction  to  reach  a maximum,  and 
then  monotonically  descends  back  to  the  axis  (g ( 1 ) = 0)  to  enter  with 
a negative  slope  (g'(1)  = - 1).  When  assembled  in  the  transformation, 
8 scales  a combination  of  positive  and  negative  bumps  which  are  joined 
together  with  matching  nonzero  slopes.  The  addition  to  the  line  s = t 
then  represents  a local  distortion  of  it  which  causes  clustering  but 
which  preserves  the  uniform  spacing  elsewhere.  An  illustration  of  the 
transformation  is  given  in  Fig.  4. 


From  a geometric  viewpoint,  we  have  simply  taken  the  uniform 
transformation  s = t and  have  given  it  a local  clockwise  twist  about 
tQ.  The  severity  of  the  twist  is  controlled  by  the  slope  at  tQ  and  to 
some  extent  by  the  location  of  the  maximum  displacement  from  s = t. 
This  location  corresponds  to  the  break-even  point  where  the  spacing 
from  the  transformation  matches  the  uniform  spacing  from  s = t;  or  in 
other  words,  where  the  two  slopes  match.  In  the  case  of  Eq.  6,  the 
choice  of  a = j led  to  a maximum  displacement  at  x = 0.6.  With  the 
more  general  piecewise  construction,  it  occurs  at  x = 1/(2-a)  and 
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Figure  An  elementary  parameter  transformation 
in  the  forward  direction  of  new  t going  into  old  s 


thus  can  be  controlled  with  the  choice  of  a.  The  analytical  formula- 
tion is  obtained  by  repeating  the  development  with  the  f from  Eq.  3 
in  place  of  f.  Moreover,  the  locations  can  be  controlled  separately 
on  each  side  of  tg  by  using  corresponding  distinct  and  The 

analytical  formulation  is  only  altered  by  replacing  the  two  applica- 
tions of  g in  Eq.  6a  with  the  corresponding  generalizations  g and 

a1 

g of  Eq.  6b. 

°a„ 

2 
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ALTERNATIVE  FORMULATIONS 


While  further  shape  control  over  the  forward  transformation  de- 
picted in  Fig.  4 can  be  exercised  with  more  exotic  piecewise  construc- 
tions, we  shall  instead  examine  some  alternatives  which  offer  less 
shape  control  but  which  present  attractive  options  because  of  their 
simplicity  in  statement.  In  this  regard,  we  first  note  that  the  prev- 
ious piecewise  constructions  achieved  a high  degree  of  algebraic  sim- 
plicity at  the  expense  of  doing  it  in  a number  of  successive  defining 
intervals. 

As  a first  step,  we  shall  consider  formulations  which  reduce  the 
number  of  defining  intervals.  Returning  to  the  unit  interval  on  which 
we  established  f and  then  g,  we  shall  consider  a replacement.  Noting 
that  second-  and  first-order  zeros  for  g are  desired  respectively  at  0 
and  1 , we  are  brought  to  consider  the  simplest  positive  bump  function 

gQ(x)  = xa(1-x)  (7) 

which  satisfies  the  conditions  when  o > 1 and  which  is  defined  by  one 
segment.  The  derivative 

f^Ca)  = xa  1 [a  - (a+1)x]  (8) 

assumes  the  value  of  -1  at  x=1  and  is  also  seen  to  vanish  at  x = 
a/(1+a),  which  by  our  previous  observations  is  the  break-even  point 
with  uniform  spacing.  As  earlier,  the  location  can  be  adjusted  with  a 
choice  of  a.  In  distinction,  this  a alters  the  complexity  of  the 
equation  by  creating  larger  powers  when  we  wish  to  push  the  break-even 
point  towards  1.  By  contrast,  the  original  piecewise  development  re- 
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quired  only  a shift  of  juncture  point  for  the  same  purpose.  The  ap- 
plication of  Eq.  7 to  create  a forward  transformation  is  direct  and  is 
accomplished  by  simply  replacing  the  g in  Eq.  6 with  the  g^  of  Eq.  7. 
This  can  be  done  for  either  one  or  both  of  the  intervals  about  tQ  and 
each  can  have  a separate  adjustable  a.  Because  f (1)  = - 1,  the  con- 
trol over  minimum  spacing  by  6 is  exactly  the  same.  Thus,  while  we 
also  retain  a capability  to  separately  control  the  locations  of  break- 
even points,  we  have  been  able  to  reduce  the  number  of  defining  inter- 
vals: non-zero  values  now  appear  on  two  rather  than  four  intervals. 

In  continuation,  we  seek  a further  reduction  to  a single  interval 
with  non-zero  values.  To  do  this,  we  shall  construct  a function  which 
will  directly  produce  symmetric  clusters.  Rather  than  considering  the 
unit  interval,  we  will  develop  the  function  on  the  larger  interval 
from  -1  to  1.  To  start,  we  form  a symmetric  positive  bump  with  the 
function 


ha(x)  = (1-x)a(1+x)a  (9) 

which  is  attached  to  the  axis  with  vanishing  first  derivatives  when  a 
> 1 and  which  has  a single  maximum  value  of  unity  when  x = 0.  At  this 
stage,  a monotonically  decreasing  function  through  the  origin  is 
needed  as  a factor  to  produce  a negative  slope  at  the  origin  and  to 
split  the  bump  into  a positive  bump  before  origin  and  a negative  bump 
after  the  origin.  If  u(x)  is  such  a function,  then  the  derivative  of 
uha  at  x = 0 is  just  u’(0)  since  ha(0)  = 1 and  h^(0)  = 0.  The  sim- 
plest such  choice  is  to  set  u(x)  = - x.  The  associated  function  is 


then 


(10) 


gQ(x)  = - x(1-x)“(1 +x)a 


and  satisfies  the  properties  ga(±1)  = g^(±1)  = g^CO)  = 0 and  g^(0)  = 
- 1.  For  a cluster  interval  of  length  2T  about  tg,  we  take  x = 
(t-tQ)/T  in  Eq.  10  and  obtain  the  transf ormation 


s = 


t-t 

t + BTg  ( 


l 1 


for  tQ-T  ^ t £ tQ+T 
otherwise 


■\ 


(id 


by  vertically  scaling  the  resultant  bump  pair  by  BT  and  then  adding  it 
to  the  uniform  mapping  s = t.  By  direct  differentiation,  we  have  the 
weight  function  of  unity  everywhere  except  on  the  interval  about  tg 
where  it  assumes  the  form 


w = 1 ♦ B[  (2a+1 ) ("Y"")2  - 1][1  - (^)2]a_1 


(12) 


The  evaluation  at  tg  gives  precisely  the  earlier  clustering  control  & 

which  produces  the  shrinking  factor  of  1-$.  The  motivation  to  get  the 

same  control  came  from  g'(0)  = - 1 and  the  chain  rule  contribution  of 

a 

1 /T  as  a factor.  The  break-even  points  with  uniform  spacing  are  given 
when  w = 1 in  the  interval  about  tQ  and  are  just  tQ  ± T//2ct+1 . As  a 
increases,  these  points  then  symmetrically  approach  tg.  The  largest 
possible  distance  is  bounded  by  T//3  in  correspondence  with  a = 1 
which  is  the  lower  bound  for  a.  Thus,  a can  be  used  to  control  the 
distance  of  break-even  points  from  tg  over  an  interval  from  0 to  T//3. 
To  have  at  least  one-third  of  the  interval  for  clustering,  this  choice 
must  be  for  a between  1 and  4.  Undoubtedly,  variations  on  this  theme 
could  be  executed  both  to  produce  larger  regions  for  the  final  clus- 
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tering  compression  and  to  insert  a desired  amount  of  asymmetry. 

Rather  than  pursue  these  variations,  we  shall  inspect  the  further 
possibility  of  asymptotic  approximation  with  the  desire  to  simply  de- 
fine an  elementary  clustering  transformation  in  one  global  statement 
without  having  to  establish  a particular  clustering  interval.  From 
this  viewpoint,  such  intervals  are  implicitly  defined  when  the  asymp- 
totic decay  is  essentially  complete.  The  impreciseness  here  then 
gives  us  only  a fuzzy  definition.  By  contrast,  however,  we  shall  see 
that  the  earlier  break-even  points  can  be  established  precisely. 

As  in  the  last  case,  we  multiply  a positive  bump  function  by  the 
monotonically  decreasing  function  - x which  passes  through  the  origin. 
To  start,  we  consider  the  bump  function  (1+x2)  a and  arrive  at 


g (x)  = - x ( 1 +x 2 ) “ 


a 


(13) 


which  decays  when  a > 1 . The  uniform  transformation  s = t is  now  al- 
tered for  local  clustering  about  tp  by  setting 


t-tn 

= t + BTgJ-—) 


cuo 


Once  a decay  rate  a is  chosen,  the  length  scale  T is  used  to  appropri- 
ately shrink  or  expand  the  region  of  primary  influence.  By  differen- 
tiation, the  associated  weight  is  given  by 


w = 1 + 6 


(2a-1)(^V  - 1 

[i  ♦ (^Vr1 


(.15) 


This  reduces  to  w = 1-g  at  the  cluster  center  tQ  and  thereby  retains 
the  meaning  of  the  previous  intensity  controls  B.  The  decay  rate  a 
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controls  the  location  of  break-even  points  which  from  Eq.  15  appear  at 
a distance  of  T//2ot-1  on  either  side  of  tg.  An  increase  in  a simply 
causes  a shift  towards  tQ  relative  to  the  scaling  T.  At  the  other  ex- 
treme, as  a approaches  1,  the  shift  is  away  from  tQ  and  is  bounded  by 
T.  Altogether,  adjustments  in  decay  rate  allow  break-even  points  to 
be  located  anywhere  between  0 and  T units  away  from  the  center  tQ.  At 
the  extreme  of  T,  the  effective  clustering  region  is  enlarged  beyond 
T.  To  keep  it,  say  within  T units  of  tQ,  a somewhat  conservative 
choice  is  needed. 

In  the  same  spirit,  we  may  also  repeat  the  asymptotic  construc- 
tion with  notably  different  analytical  formulas.  For  example,  we  may 
decide  that  a better  bump  function  would  be  given  by  the  Gaussian  form 

— CtX  ^ 

e and  would  then  get 

~ ■ ctx  ^ 

Sa(x)  = ~ xe  (l6) 


in  place  of  Eq.  13*  This  would  correspondingly  be  used  in  Eq.  14  with 
the  same  interpretations  for  T and  would  lead  to  the  weight 


w = 1 


e[2a(- 


t“t 


)2  - l]e*p[-  2 ] 


(IT) 


with  the  same  clustering  intensity  control  &.  The  positive  damping 
rate  a is  a control  over  the  location  of  the  break-even  points  rela- 
tive to  T.  These  are  located  at  a distance  of  T//2a  on  either  side  of 


THE  APPLICATIONS  SETTING 

To  describe  the  setting  in  which  applications  are  to  be  perform- 
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ed,  we  take  note  both  of  the  general  topic  of  grid  generation  and  of 
the  order  of  application.  Grid  generation  arose  as  a topic  of  study 
in  response  to  the  need  for  numerical  simulations  of  realistic  physi- 
cal systems.  It  has  now  been  the  subject  of  three  general  reviews  [1] 
- [3],  three  major  conferences  [4]  - [6]  and  one  textbook  [7].  A fun- 
damental part  of  grid  generation  is  the  determination  of  pointwise 
distributions  on  curves.  This  occurs  because  curves  are  basic  con- 
structive elements  in  virtually  any  approach  to  grid  generation.  At 
the  very  least,  they  represent  boundaries  of  two-dimensional  regions 
and  are  typically  used  to  create  bounding  surfaces  for  three-dimen- 
sional regions.  The  pointwise  distribution  on  them  directly  influ- 
ences the  regional  grid  regardless  of  the  method  employed  to  generate 
that  grid.  The  further  redistribution  of  families  of  curves  or  sur- 
faces within  a regional  grid  is  also  a typical  consequence  of  the  re- 
distribution of  points  along  curves. 

To  accomplish  the  redistribution  of  points  along  curves  in  a pre- 
cise manner,  we  have  developed  herein  the  elementary  operation  of  cre- 
ating a single  local  cluster  about  a point.  The  application  of  the 
operation  to  a succession  of  points  can  be  ordered  in  either  of  two 
natural  ways:  the  points  are  taken  one  at  a time  or  they  are  done 

simultaneously.  In  correspondence,  we  may  view  the  first  as  most 
ideally  suited  to  an  interactive  graphical  environment  while  the  sec- 
ond appears  more  attractive  for  an  automatic  approach. 

In  the  interactive  setting,  we  assume  that  someone  is  sitting  at 
a graphics  terminal  or  workstation  with  the  capability  to  view  the 
pointwise  distribution  on  the  curve  and  to  locate  or  insert  pointwise 
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data  by  means  of  a cursor.  For  simplicity,  we  will  assume  that  the 
cluster  center  and  endpoints  are  taken  from  the  existing  grid  points 
on  the  curve  rather  than  at  intermediate  locations  which  would  then 
necessitate  an  interpolation.  With  this  assumption,  the  cursor  is 
used  to  identify  those  grid  points  according  to  their  indices.  Since 
the  grid  on  the  curve  is  the  result  of  mapping  a uniform  grid  in  a 
parameter  s and  since  the  corresponding  uniform  spacing  can  be  taken 
as  unity,  the  indices  directly  give  the  parametric  distance  that  the 
endpoints  s_  and  s+  are  from  the  center  sQ.  If  we  take  sQ  = 0,  then 
s_  and  s+  are  respectively  the  number  of  grid  points  below  and  above 
the  cluster  center.  In  terms  of  our  new  uniform  parameter  t,  this  be- 
comes t_  = s_ , tQ  = 0,  and  t+  = s+.  Next,  the  desired  fractional  de- 
crease in  spacing  1-g  is  chosen  for  the  center.  The  forward  mapping 
from  Eq.  6 (or  any  of  the  equivalent  variants)  is  now  applied  within 
the  interval  from  t_  to  t+  to  produce  a local  cluster  of  points  about 
s0  = 0.  Unlike  the  center  point  and  the  points  outside  this  interval, 
the  clustering  has  caused  points  to  fall  generally  between  the  old 
uniformly  spaced  points  in  s.  If  the  curve  is  given  analytically  in 
terms  of  s,  then  the  old  mapping  is  just  an  evaluation  at  those  in  be- 
tween points.  Otherwise,  for  each  new  position  in  s,  we  must  find  the 
unit  grid  interval  that  contains  it  and  then  linearly  interpolate  the 
old  map  from  s to  the  curve  to  get  the  new  grid  point  location  on  the 
curve.  In  this  process,  there  is  no  need  to  operate  on  the  points 
outside  of  the  cluster  interval  since  they  remain  fixed.  Upon  appli- 
cation of  such  an  elementary  clustering  operation,  the  new  distribu- 
tion is  viewed  and  then  a judgment  to  stop  or  continue  is  made.  If 
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the  previous  distribution  is  stored,  then  there  is  also  the  option  to 
easily  restore  it  should  we  not  like  the  result.  Altogether,  by  ap- 
plying the  elementary  clustering  operations  one  at  a time,  we  are  able 
to  interactively  manipulate  the  pointwise  distributions  on  curves. 

In  the  context  where  the  judgments  for  clustering  are  determined 
automatically  for  a collection  of  locations,  it  is  more  attractive  to 
perform  a single  mapping  rather  than  a succession  of  mappings.  Cer- 
tainly, as  the  cluster  regions  overlap  each  other,  the  successive  map- 
ping approach  becomes  more  repetitious  and  less  efficient.  To  obtain 
a single  mapping,  we  may  proceed  from  either  of  two  viewpoints.  The 
first  is  to  consider  what  would  have  occurred  had  we  done  successive 
mappings.  For  any  given  order  of  mappings,  the  single  mapping  would 
be  a successive  composition  in  the  same  order.  By  applying  the  chain 
rule  at  each  stage,  the  weight  for  the  single  mapping  is  just  the  pro- 
duct of  the  weights  from  the  elementary  cluster  maps.  We  note  that 
the  elementary  clustering  weights  are  of  the  form  w^  = 1 + g^C^  for 

cluster  functions  and  intensities  fL  where  i = 1,  2 n and  n 

is  the  number  of  clusters.  The  weight  for  the  single  mapping  is  then 

W - (i  + B1c1 ) Cl  + B2c2)  ••••  (l  + BnCn)  (18) 

which  is  independent  of  the  order  of  application.  Unfortunately,  the 
product  is  not  particularly  convenient  to  integrate.  As  a conse- 
quence, the  linear  fL -approximation 

w = 1 + g.  C + g C,  + + g C (19) 

11  2 2 n n 

is  preferred  and  is  also  order-independent.  Thus,  the  forward  mapping 
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results  from  Eq.  2 by  adding  to  uniform  t = s the  scaled  bump  pairs 


from  each  as  i = 1 n.  This  superposition  can  then  be 

viewed  in  the  format  of  Fig.  4 where  now  the  single  twist  about  sQ  = 
tQ  is  replaced  by  n of  them.  In  this  context  of  n simultaneous  clus- 
ters, we  note  that  a choice  of  specific  intervals  for  each  results  in 
a detailed  partition  of  s that  can  be  avoided  if  we  employ  asymptotic 
approximations  of  the  nature  discussed  in  the  section  on  alternative 
formulations . 


CONCLUSIONS 

The  capability  to  locally  manipulate  pointwise  distributions  on 
curves  was  established  through  the  introduction  of  an  elementary  oper- 
ation for  locally  clustering  points  about  any  single  point.  The  oper- 
ation was  created  as  a reparametrization  where  the  spacing  between  new 
and  old  parameters  is  prescribed  by  means  of  a weight  function.  Vari- 
ous constraints  upon  the  weights  were  established  and  the  correspond- 
ing transformations  were  examined.  It  was  found  that  the  forward 
transformations  from  new  to  old  are  better  because  the  composition  of 
mappings  is  simpler  and  because  the  clustering  intensity  control  is 
not  limited  as  it  is  in  the  backward  case. 

The  basic  elements  of  construction  were  done  in  the  most  flexible 
manner  by  using  piecewise  linear  weights.  This  gave  piecewise  quad- 
ratic transformations  that  were  nontrivially  defined  over  four  inter- 
vals, and  more  importantly,  gave  the  fundamental  guidelines  for  more 
arbitrary  constructions.  Rather  than  pursue  the  greater  degree  of 
shape  control  that  is  available  from  general  piecewise  polynomial  con- 
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structions,  alternatives  were  presented  to  reduce  the  number  of  inter- 
vals of  definition  and  thereby  simplify  the  statement  of  the  trans- 
formations. This  viewpoint  was  taken  up  to  the  stage  where  endpoints 
of  the  local  cluster  region  were  only  defined  in  a fuzzy  sense  by  us- 
ing asymptotic  forms.  These  are  attractive  due  to  their  simple  global 
expression  in  one  statement  rather  than  in  the  previous  piecemeal 
fashion.  In  summary,  we  first  established  a class  of  transformations 
that  are  suitable  for  elemenatry  clustering  operations  and  then  we  ex- 
plored a broad  range  of  attractive  candidates  from  that  class. 

The  most  obvious  demand  for  the  local  redistribution  of  points 
along  curves  occurs  within  the  topic  of  grid  generation  and  to  some 
extent  provides  a general  applications  setting.  In  a more  particular 
sense,  the  applications  are  considered  to  occur  in  sequence  or  simul- 
taneously. Cases  where  only  certain  parts  are  simultaneous  can  be 
subdivided  into  either  of  these  two  possibilities.  The  sequential 
order  of  application  is  ideally  suited  to  interactive  graphics  while 
the  simultaneous  application  is  well  suited  to  automation. 
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ABSTRACT 

This  work  deals  with  the  problem  of  a boundary  layer  on  a flat  plate 
which  has  a constant  velocity  opposite  in  direction  to  that  of  the  uniform 
mainstream.  It  has  previously  been  shown  that  the  solution  of  this  boundary 
value  problem  is  crucially  dependent  on  the  parameter  which  is  the  ratio  of 
the  velocity  of  the  plate  to  the  velocity  of  the  free  stream.  In  particular, 
it  was  proved  that  a solution  exists  only  if  this  parameter  does  not  exceed  a 
certain  critical  value,  and  numerical  evidence  was  adduced  to  show  that  this 
solution  is  nonunique.  Using  Crocco  formulation  the  present  work  proves  this 
nonuniqueness.  Also  considered  are  the  analyticity  of  solutions  and  the 
derivation  of  upper  bounds  on  the  critical  value  of  wall  velocity  parameter. 
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1 . Introduction 


The  boundary  layer  on  the  upstream-moving  flat  plate  at  zero  incidence 
admits  of  the  classical  similarity  transformation  which  reduces  the  relevant 
partial  differential  equations  to  the  Blasius  equation. 


£"'  + ff"  = 0 
f(0)  = 0 

f'(0)  = -X,  X > 0 
f'(“)  = 1, 


where  f = t|)(x,y)//(2vx) , i}>  being  the  dimensional  stream  function,  and  v 
the  kinematic  viscosity,  and  q = y//( 2vx) . This  equation  can  be  readily 
integrated  once  to  yield 


i.e. , 


f"(ri) 


f"(0)exp 


f(z)dz  , 


f"(n) 


f"(0)exp 


(q  - z)^  f"(z)dz 


using  integration  by  parts  twice.  Obviously,  the  shear  stress  f"(q)  has  the 
same  sign  as  the  skin-friction  at  the  wall,  f"(0).  For  X = 0,  Weyl  proved 
the  existence  and  uniqueness  using  function-theoretical  methods.  For  X < 0, 
Callegari  and  Friedman  and  Callegari  and  Nachman  found  it  expedient  to  work 
with  the  Crocco  formulation,  that  is,  in  terms  of  shear  stress  g(=f")  as  the 
dependent  variable  and  tangential  velocity  u(=f")  as  the  independent 
variable: 


558 


g(u)g"(u)  + u = 0,  -X  < u < 1, 


g'(-A)  = 0 
g(l)  = 0. 


For  \ 0,  they  proved  existence,  uniqueness,  and  analyticity  of 

solutions  to  Eq.  (2)  using  an  analytical  function  theory  approach.  For  the 

case  X > 0,  Hussaini  and  Lakin  proved  that  a solution  exists  only  for  X 

less  than  a critical  value  Ac>  Their  numerical  results  showed  nonuniqueness 

for  A < X , and  the  numerical  value  of  X was  found  to  be  0.3541...  . In 
c c 

this  work,  the  nonuniqueness  is  established  rigorously.  Also,  proof  of 
analyticity,  and  absolute  monotonicity  etc.,  is  given.  Certain  analytical 
upper  bounds  on  X are  established. 

For  convenience,  we  use  the  transformation  x = u + X to  map  the 
interval  -X  < u < 1 , to  0<x<l+X.  So  we  consider  the  equations 


g(x)g" (x)  + (x  - X)  = 0,  0 < x < 1 + X (1.1) 


g'(0)  = 0 


g(l  + X)  = 0. 


(1.2) 


2.  Analiticity  of  Solutions 

In  this  section,  the  following  basic  result  will  be  proved: 

THEOREM  1:  There  is  a range  of  positive  values  of  X such  that  the 

positive  continuous  solution  g(x)  of  the  boundary  value  problem  (1.1)  and 
(1.2)  is  analytic  on  the  closed  intervel  [0,1  + X]. 
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This  theorem  will  be  proved  by  considering  a sequence  of  lemmas.  The  first 
lemma  required  is : 


LEMMA.  Is  The  derivative  g^(x)  vanishes  at  one  and  only  one  point  on 

the  interval  0 < x < 1 + X.  Further , g(x)  has  its  maximum  value  at  this 

point. 

Proof  of  Lemma  1 : Equation  (1.1)  can  be  integrated  using  the  initial 

condition  g^(0)  =0  to  give 


s'(x)  l mrd*’  (2,1) 

Thus,  as  the  initial  value  a = g(0)  > 0,  both  g(x)  and  g'(x)  are  positive 
for  0 < x ^ X.  Also, 

g" (x)  = (X  - x) / g(x)  (2.2) 

is  positive  for  0 < x < X and  g"(X)  = 0.  The  continuous  solution  g(x) 

remains  positive  for  X < x < 1 + X,  and  hence  g"(x)  is  now  negative.  This 

gives  that  g'(x)  is  a monotone  decreasing  function  for  x > X.  As 
g'(l  + X)  = there  must  thus  be  at  least  one  point  on  the  interval 

( X , 1 + X)  at  which  g'(x)  vanishes.  In  fact,  assuming  that  g'(x)  vanishes 
at  more  than  one  point  leads  to  a contradiction,  for  suppose  that  g'  vanishes 

at  both  x^  and  ^ with  x^  < X£.  Then,  g"  would  have  to  vanish  at  least 

once  between  these  two  points  which  is  impossible  as  g"  < 0 for  x > X.  The 
proof  of  Lemma  1 is  concluded  by  noting  that  g"(xj)  < 0 implies  that 
g(xj)  must  be  the  maximum  value  of  g(x). 
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LEMMA.  2:  The  solution  g(x)  has  a convergent  power  series  expansion  on 


the  closed  interval  [x^,l  + X], 


Proof  of  Lemma  2 ; As  g(x)  is  positive  and  differentiable  for 
Xj  x < 1 + X,  equation  (2.2)  shows  that  g(x)  has  derivatives  of  all 
orders  on  this  interval.  Further,  expressions  for  these  derivatives  may  be 
obtained  directly  from  the  differential  equation  (1.1).  Induction  shows  that 
for  n > 1 , derivatives  of  g(x)  satisfy  the  recursion  relation 


g(n+3)  . . i j 


(n+1 )g'  g(n+2)  + 


1 n+ 1 r 

i 1 I'- 


n+1 

k+3 


) + ("t‘)  S 


(k)l 


r ( n-k+3 ) | 


(2.3) 


where  g(^)  is  the  k-th  derivative  of  g with  respect  to  x 
the  usual  combinatorial  symbol. 

Let  g(x^)  = B,  and  consider  the  auxilliary  function  G(x) 


and  fp)  is 

'■q 1 

defined  by 


G(x)  = B - g(x) . 


(2.4) 


Then,  as  B is  the  maximum  value  of  g(x),  G(x)  is  non-negative  for 
Xj  < x < 1 + X.  Also,  for  all  n 1 , Gn(x)  = -g(n)(x).  Consequently, 
equation  (2.1)  shows  that  G'(x)  is  positive  on  the  interval  x x < 1 + X. 
From  (1.1), 


G"(x)  = f (ir  and  G’ "'(x) 


1 + G"  G" 
g(x) 


are  also  both  positive  on  this  interval.  The  recursion  relation  (2.3)  thus 
shows  that  all  derivatives  of  G(x)  are  non-negative  on  the  closed  interval 
[x^,l  + X - e]  where  l+X-x^>e>0.  Hence,  G(x)  is  absolutely 
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monotonic  on  this  closed  interval,  A theorem  of  Bernstein  [4]  now  gives 
that  G(x)  has  a convergent  Taylor  series  expansion  about  the  point  x^ 
whose  radius  of  convergence  is  not  less  than  1 + A - x . From  the  definition 
of  G(x),  it  immediately  follows  that  for  |x  - Xj|  < 1 + A - x^ , g(x)  has 
the  convergent  expansion 


g(x) 


I 

n=0 


g^CXj) 

n! 


(2.5) 


Application  of  a Tauberian  theorem  [5]  further  shows  that  the  power  series 
(2.5)  converges  at  the  singular  point  x = 1 + A to  the  value  g(l  + A)  = 0 
completing  the  proof  of  Lemma  2. 

To  establish  Theorem  1,  it  must  be  shown  that  for  a nontrivial  range  of 
positive  values  of  A,  the  power  series  (2.5)  for  the  solution  g(x)  of  the 
boundary  value  problem  (1.1)  and  (1.2)  converges  at  the  left  boundary  point 
x = 0.  This  will  be  accomplished  in  Lemma  3.  A consequence  of  this 

convergence  will  be  an  expansion  for  the  initial  value  of  g(x)  as  the  series 

oo  ( - 1 ) n xn 

« = B+  I g(n)(x  ).  (2.6) 

n=2  1 

LEMMA  3:  There  exists  a positive  value  T such  that  if  0 < X < T 

then  Xj  < (1  + X)/2. 


Lemma  3 gives  that  the  left-hand  boundary  point  x = 0 lies  inside  the  radius 
of  convergence  of  the  power  series  expansion  (2.5).  Consequently,  the 
corresponding  solution  of  the  boundary  value  problem  will  be  analytic.  It 
should  be  noted  that  the  upper  bound  on  x1  given  in  Lemma  3 is  a sufficient, 
but  not  a necessary,  condition  for  convergence. 
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Proof  of  Lemma  3:  Equation  (1.1)  may  be  integrated  from  0 to  x using 


the  identity  gg"  = (gg')'  - 
second  integration  from  0 to 

xJ(Xl  - 3X) 

■ — — ■ ■ « - = 

6 


(g')2  and  the  initial  condition 
Xj  now  gives  the  result 

2 2 X1 

” ' v ~ + f (x,  - 5)g'2(Od?. 

0 1 


g'(0) 


0.  A 


(2.7) 


An  upper  bound  on  the  right-hand  side  of  (2.7)  and  a lower  bound  on  the 
maximum  point  x^  are  now  required  to  establish  the  lemma. 

A lower  bound  on  x^  may  be  obtained  by  using  (2.1)  and  the  fact  that 
g'(x^)  =0  to  obtain 

l mr ds " { 1 !r?r “5-  (2-8) 

As  g(x)  is  monotone  increasing  on  [0^],  g(x)  £ g(X)  on  [0 , X ] , but 
g(X)  £ g(x)  on  [X,Xj],  Equation  (2.8)  now  gives 


2X. 


(2.9) 


As  g(x)  has  its  only  maximum  at  Xj  by  Lemma  1,  an  immediate  lower 
bound  on  g(x^)  =8  is  8 > a.  A sharper  lower  bound  on  8 can  be  obtained 
from  the  expression 


8 


a + 


(x  - 0(X  - 5) 

— mr—ds 


0 


(x  -.Q2 

g(0 


(2.10) 


obtained  by  integrating  (2.1)  from  0 to  xj.  As  g(x)  £ 8,  and  by  (2.9), 
Xj  - X _>  X,  equation  (2.10)  now  gives  the  quadratic  inequality 
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(2.11) 


g2  - a6  - 2j-  > 0 


which  implies 


3 > 


a + / a2  + 8X3 


/ 3 


(2.12) 


2 2 

A lower  bound  on  8 - a which  follows  from  (2.12)  is  thus 


.2  2 S2X 

3 - a > 


- 3 * 


(2.13) 


Consider  next  bounds  on  the  initial  value  a.  Let  X = 1 + X.  Then, 
integrating  (2.1)  from  0 to  X and  using  g(X)  = 0 gives 


X 


« “ / 
0 


(x  - 5)(£  - X) 



d£. 


(2.14) 


This  relation  may  be  rewritten  in  terms  of  strictly  positive  integrals  as 


fX  (x  - 5)(e  - x)  ,X  (x  - 5)(X  - 5)  dc 

° J J xvyy\ d5 


(2.15) 


which  shows 


« < / 

X 


(x  - £)(S  - x) 

gCT5 


dC. 


(2.16) 


The  convexity  of  g(x)  on  [X,X]  implies  that  on  this  interval 
g(x)  ^g  (X)»(X  - x).  Equation  (2.16)  now  gives  that  a < (2g(X))~*. 
a < g(X),  this  further  implies 

a2  < V2  • 


As 


(2.17) 
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Equation  (2.15)  does  not  lend  itself  to  the  derivation  of  a lower  bound 

2 

on  oc  • However,  in  the  present  consideration  of  analyticity,  the  required 
bound  can  be  obtained  from  a relation  between  a and  g which  follows  from 
the  existence  proof  of  Hussaini  and  Lakin  [3].  This  proof  shows  that  if  X 
is  positive  and  does  not  exceed  a critical  value,  there  is  at  least  one 
initial  value  a such  that  a positive  continuous  solution  of  the  initial 
value  problem  consisting  of  (1.1)  and  the  conditions  g(0)  = a and  g"(0)  = 0 
exists  and  has  g(X)  = 0,  i.e.,  it  is  a solution  of  the  boundary  value 

problem.  Further,  the  solution  of  the  initial  value  problem  will  be  unique  if 

g < 2a.  (2.18) 

It  must  be  noted  that  a unique  solution  of  the  initial  value  problem  dn«*  not 
imply  a unique  solution  of  the  boundary  value  problem.  This  will  be  shown  in 
section  4. 

2 

A lower  bound  on  a follows  by  using  (2.18)  in  (2.12).  The  result  is 

2 X3 

a > y-  . (2.19) 

The  final  bound  needed  for  use  in  equation  (2.7)  is  an  upper  bound  for 
g'(x)  on  the  interval  [OjXj].  From  (2.2),  g"(x)  is  a monotone  decreasing 
function  on  this  interval.  Further,  g"(X)  = 0 while  the  third  derivative 
of  g is  negative  when  x = X.  Thus,  g"(x)  has  its  maximum  value  at  x = X. 
This  implies  that  on  [0,Xj] 

X , _ - 

0 < g'(x)  < g'(\)  = f d?*  (2.20) 
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As  g(x)  _>  a on  C 0 , A ] , equation  (2.20)  gives 


0 < g'(x)  < ^ 


(2.21) 


An  upper  bound  on  the  integral  in  equation  (2.7)  is  thus 

X1 

f (xx  - 5)g'2(C)d?  <|  Xx2 . (2.22) 

Use  of  (2.13)  and  (2.22)  in  equation  (2.7)  implies 

x2(xi  - X)  + 2X3  < 0.  (2.23) 

This  relation  gives  that  Xj  will  be  less  than  X/2  for  X in  the  range 
0 < X < X = 0*1176.  The  sufficient  condition  for  analyticity  is  thus 
satisfied  for  a range  of  positive  values  of  X establishing  Lemma  3 and 
Theorem  1 • 

Equation  (2.9)  implies  that  x^  cannot  be  less  than  X/2  if  X > 1/3. 
Indeed,  direct  numerical  solution  of  the  boundary  value  problem  shows  that 

A 

< X/2  when  X < X - 0.32  and  a lies  on  the  upper  branch  in  Figure  1.  The 
gap  between  the  values  of  X and  X is  associated  with  fundamental  problems 
in  obtaining  sharper  bounds  on  the  initial  value  a.  For  example,  equation 
(2.15)  implies 


« < / 
X 


(x  - g)(S 

gT§T 


- x) 


d 5 + / 


(s  - x) 

g(S) 


d§  - 


ix  <«  - »2  dE 

l g<5>  d?' 


(2.24) 
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Individually,  the  last  two  integrals  in  (2.24)  are  formally  infinite,  yet  they 

must  cancel  so  as  to  give  an  order  one  upper  bound.  Direct  numerical 

2 2 

calculations  show  that  the  upper  bound  on  u is  a < 0.219961.  The  upper 
bound  in  (2.17)  is  thus  conservative  by  over  a factor  of  two. 

It  must  again  be  noted  that  x^  < X/2  is  only  a sufficient  condition  for 

analyticity.  For  values  of  a on  the  upper  branch  of  Figure  1,  solutions  of 
the  boundary  value  problem  can  thus  be  expected  to  remain  analytic  for  X 

A 

greater  than  X,  Further  insight  can  be  gained  by  examining  parameter  values 
for  which  the  condition  (2.18),  which  is  sufficient  for  a unique  solution  of 
the  associated  initial  value  problem,  is  maintained.  Numerical  results  show 
that  (2.18)  holds  for  all  values  of  a on  the  upper  branch  of  Figure  1,  It 
also  holds  for  a on  the  lower  branch  of  Figure  1 in  the  relatively  small 

range  0,351  <X  < Xc  and  is  violated  over  the  remainder  of  the  lower 

branch.  The  behavior  of  3 as  a function  of  a is  given  in  Figure  2.  For 
values  of  X associated  with  initial  values  on  much  of  the  lower  branch  of 
Figure  1,  there  must  thus  be  serious  doubts  as  to  whether  solutions  of  the 
boundary  value  problem  (1.1)  and  (1.2)  are  analytic. 


3.  An  Upper  Bound  on  X 

c 

The  existence  proof  of  Hussaini  and  Lakin  [3]  established  the  existence 

of  solutions  of  (1.1)  and  (1.2)  for  positive  values  of  X less  than  a 

critical  value  X • It  was  shown  from  (1.1)  and  (1.2)  that  X < 1/2.  The 
c c 

value  of  X^  was  also  determined  numerically  in  that  work  to  be 

Xc  - 0.3541079...  (3.1) 
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In  this  section,  additional  upper  bounds  for  X will  be  obtained  directly 
from  (1.1)  and  (1.2). 

Using  the  identity  that  precedes  equation  (2.7),  equation  (2.1)  can  be 
integrated  from  0 to  x and  the  result  integrated  again  from  0 to  X. 
As  g(X)  » 0,  this  gives 

- 3X)  . ^ t ^ 6 2 (3.2) 

6 2 0 

The  right-hand  side  of  (3.2)  is  intrinsically  positive,  and  thus 

X - 3X  > 0.  (3.3) 

This  relation  immediately  implies 

X<V2.  (3.4) 

To  obtain  sharper  bounds  now  requires  the  use  of  positive  lower  bounds 

2 

for  a and  the  integral  in  (3.2).  While  no  additional  assumptions  are 

required  to  obtain  (3.4),  in  what  follows  it  will  be  necessary  to  assume  that 
3 < 2a.  However,  as  noted  previously,  this  condition  is  satisfied  on  the 
entire  upper  branch  in  Figure  1.  In  particular,  it  is  satisfied  in  the 
limiting  case  when  X = Xc* 

Let  the  integral  I(x)  be  defined  by 

x ? 

I(x)  = / (X  - 5)g  (5)d5.  (3.5) 

0 

Then,  as  I(X)  > 0,  equation  (3.2)  implies 
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X2(X  - 3X)  > 3a2.  (3.6) 

Replacing  X by  1 + X and  using  (2.19)  now  gives  the  inequality 

3X3  + 3X2  - 1 < 0 (3.7) 

which  yields  the  improved  bound 

X < 0.47533.  (3.8) 


A slightly  sharper  bound  can  be  obtained  by  noting  that  I(X)  > 1(X). 
Let  5 = g(X).  Then,  g(x)  <6  on  [0,X],  so  on  this  interval 


g'2(x)  > (2\  - x)2.  (3.9) 

46Z 

This  leads  to  the  relation 

x5 

KX)  > — ^ (5X  + 16).  (3.10) 

1205Z 


An  upper  bound  on  6 now  follows  from  the  fact  that  g(x)  a on  [0,X]  and 


6 


a 


(?  " X)2 
' g"(D 


dC. 


In  particular, 


6 < 


3 

2X  + 1 


- 2 


(3.11) 


(3.12) 


Use  of  (3.12)  in  (3.10)  then  shows 
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(3.13) 


I(X)  > 


X5(5X  + 16) 
60(2X3  + 1) 


Equation  (3.2)  now  gives 


X2(X  - 3X)  > 3a2  + 6I(X) 


(3.14) 


which  leads  to  the  inequality 


65X6  + 


76X5  + 


10X3  + 30X2  - 10  < 0. 


(3.15) 


The  solution  of  (3.15)  is 


X < 0.46824 


(3.16) 


which  is  only  a marginal  improvement  over  (3.8). 

Even  if  the  lower  bound  on  I(x)  is  further  sharpened  by  considering 
this  integral  on  the  full  interval  [0,X],  a significant  decrease  in  the  bound 
on  X is  not  obtained.  Again,  this  is  due  to  the  difficulties  associated 
with  obtaining  sufficiently  sharp  bounds  on  the  initial  value  a. 


4.  Non-uniqueness  of  Solutions  for  0 < X < X 

c 

Using  direct  numerical  results,  Hussaini  and  Lakin  [3]  have  shown  that  if 
X is  positive  and  less  than  X£  then  solutions  of  the  boundary  value 
problem  are  not  unique.  For  a fixed  value  of  X in  this  range,  as  shown  in 
Figure  1 there  are  two  initial  values  a which  lead  to  solutions  of  the 
boundary  value  problem.  The  purpose  of  this  section  is  to  prove  this  non- 
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uniqueness  directly  from  (1.1)  and  (1.2).  To  this  end,  it  is  convenient  to 
consider  the  normalized  initial  value  problem 

hh"  + t - L = 0,  (4.1) 

h(0)  = 1,  h'(0)  = 0 (4.2) 

obtained  from  the  initial  value  problem  for  g(x)  by  taking 

g(x)  = ah(t)  with  x = a2^3  t.  (4.3) 

The  parameter  L in  (4.1)  is  related  to  a and  X through  the  expression 

L = cf2/3  X.  (4.4) 


If  h(T)  = 0 and  a(X)  is  given  by 


a = {(1  + X)/T}3/2, 


(4.5) 


then  g(X)  =0,  so  the  solution  of  the  initial  value  problem  with  initial 
value  (4.5)  will  also  be  a desired  solution  of  the  boundary  value  problem. 
Equations  (4.3)  through  (4.5)  also  imply  that  in  terms  of  T and  L 


and 


(4.6) 


a = (T  - L) 


-3/2 


(4.7) 
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LEMMA.  4:  Let  h^(t)  and  h2(t)  be  solutions  of  the  initial  value 

problem  (4.1)  and  (4.2)  corresponding  to  L values  and  L2 , 

respectively.  Then,  if  L2  > Lp  h2(t)  > h^t). 

Proof  of  Lemma  4:  For  t « L,  h(t)  must  be  of  the  form  1 + Lt2/2. 

Thus,  the  lemma  holds  for  small  values  of  t.  That  it  holds  for  0 < t < T 

can  now  be  shown  by  contradiction.  Let  t be  the  first  value  of  t at  which 

hjCt)  = h2(t).  As  hj  was  previously  less  than  h2,  this  requires 
h£(t)  < But, 

Lo  t t L,  — t 

h2(t>  — ~ > h"(t).  (4.8) 

h2(t)  hx(t)  hj(t)  1 

This  contradiction  establishes  Lemma  4.  Lemma  4 also  shows  that  if 
hjCT^  - 0 and  h2(T2)  = 0,  then  h2(Tj)  > 0.  This  implies  that: 

COROLLARY:  T2  > Tj . 

The  derivative  h'(t)  is  given  by  an  expression  analogous  to  equation 
(2.1).  As  h(0)  is  positive,  both  h(t)  and  h"(t)  will  be  positive  for 
0 ^ t <_  L.  This  shows  that  T > L.  Consequently,  the  denominators  in  (4.6) 
are  strictly  positive.  The  following  lemma  gives  a sharper  result: 

LEMMA  5:  T > 3L. 

Proof  of  Lemma  5:  Equation  (4.1)  may  be  integrated  twice  from  0 to 

t using  (4.2)  to  give 
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(4.9) 


3 2 t 

l/2h2(t)  +|--M_  = l/2+  / (t  - Oh'2(?)d?. 

This  implies 

h2(t)  + -i-  t2(t  - 3L)  > 0.  (4.10) 

Setting  t ■ T and  h(T)  = 0 now  establishes  the  lemma. 

Consider  next  the  behavior  of  T as  a function  of  L.  It  has  already 
been  shown  in  Lemma  4 that  T is  a monotone  increasing  function  of  L. 

LEMMA  6:  T(L)  is  superlinear  in  L. 

Proof  of  Lemma  6 : Let  t^  be  the  point  at  which  h^(t^)  = 0.  As  is  the 

case  for  the  original  initial  value  problem  in  the  variable  x,  there  is  one 
and  only  one  such  point,  it  lies  in  the  interval  L < t < T,  and  h(tj)  is  a 
maximum  value. 

Equation  (4.1)  may  be  multiplied  by  h'  and  divided  by  h to  give 

hh"  + h"(th~  L)  = 0.  (4.11) 

Integration  from  0 to  t produces  the  result 

7 t 

V2  Iv  + (t  - L)lnh(t)  - / lnh(5 )d£  = 0.  (4.12) 

0 

Evaluating  (4.12)  at  t^  now  shows 
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t 


1 


/ lnh(Od? 

0 

lnh(t  ) * 


(4.13) 


Next,  the  expression 

h(t)  = 1 + / -■  " h(g)  ~ ~ (4.14) 

may  be  evaluated  at  t = L to  give  an  expression  for  h(L)* 

h(L)  = 1 + 1 d?>  (4-15) 

As  h"(t)  is  non-negative  on  the  interval  [0,L],  h(t)  is  monotone 
increasing,  so  h(t)  jC  h(L),  Use  of  this  fact  in  (4*15)  gives  the  quadratic 
inequality 

3 

h2 (L)  - h(L)  - > 0 (4.16) 

which  implies  h2(L)  L^/3. 

tj.  Consequently, 

One  additional  bound  is  needed  before  demonstrating  the  superlinear 
behavior  of  T(L).  The  change  of  concavity  of  h(t)  on  the  interval  [0,t^] 
due  to  the  fact  that  hn(L)  = 0 precludes  obtaining  as  a lower  bound  for  h 
on  this  interval  the  straight  line  which  passes  through  the  origin  and  the 
point  (tphCtj)),  i.e.,  it  cannot  be  shown  that  h(t)  > h(tj)*t/tlt  However, 
for  a given  L,  it  is  clear  that  h(t)  can  be  bounded  below  on  this  interval 
by  a curve  of  the  form 


The  solution  h(t)  has  its  maximum  value  at 

h(ti)  > Z\~  • (4*i?) 
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h(t.)tk 

H(t;k) i (4.18) 

for  a value  of  k > 1.  As  k increases,  these  curves  become  progressively 
more  convex.  It  should  be  noted  that  if  H(t,k)  provides  a lower  bound  on 
[Ojtj]  for  the  solution  of  (4.1)  and  (4.2)  associated  with  L * L,  then,  by 
Lemma  4,  H(t,k)  also  provides  a lower  bound  for  solutions  associated  with 
larger  values  of  L. 

This  lower  bound  for  h(t)  on  [0,t^]  may  be  used  to  obtain  an  lower 
bound  for  the  integral  in  equation  (4.13).  In  particular, 

t 

/ lnh(£ )d£  > t lnh(t  ) - kt  . (4.19) 

0 ill 

Equation  (4.13)  now  implies  that 

t1>^lnh(t1).  (4.20) 

Use  of  (4.17)  then  gives 

1 > ci  ln  (773)-  (4-21> 

The  superlinear  behavior  of  T(L)  is  thus  established. 

THEOREM  2:  For  positive  values  of  X in  the  range  0 < X < Xc> 

solutions  of  the  boundary  value  problem  (1.1)  and  (1.2)  are  not  unique. 
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Proof  of  Therein  2:  Consider  the  behavior  of  L as  a function  of  X.  By 

(4,4),  L(0)  = 0.  Equation  (4*6)  and  the  superlinear  behavior  of  T with 
repect  to  L shown  in  Lemma  6 now  imply  that  the  graph  of  X vs  L must  be 
as  in  Figure  3.  In  particular,  for  a fixed  positive  X which  is  less  than 
Xc,  there  will  be  two  distinct  values  of  L.  By  the  corollary  to  Lemma  4, 
each  value  of  L must  correspond  to  a different  value  of  T.  Equation  (4.5) 
now  shows  that  for  the  fixed  value  of  X,  two  distinct  values  and 

exist  such  that  the  solutions  of  the  initial  value  problems  with  these  a^s 
are  solutions  of  the  boundary  value  problem  (1.1)  and  (1.2).  Solutions  of  the 
boundary  value  problem  are  thus  not  unique  completing  the  proof  of  Theorem  2. 
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Figure  1 . 


Values  of  the  parameter  a = f"(0)  for  which  f'(~)  = 1 as  a 
function  of  X . 
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Figure  2. 


Values  of  the  maximum  value  g of  g(x)  as  a function  of  the 
initial  value  g(0)  = a.  The  dotted  line  is  g = 2a. 
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Figure  3. 


The  qualitative  behavior  of  the  parameter  L in  the  initial 
value  problem  (4.1)  and  (4.2)  as  a function  of  \. 
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Abstract 

The  mathematical  properties  of  the  pressure-velocity  and  vorticity- 
velocity  formulations  of  the  equations  of  viscous  flow  are  compared.  It  is 
shown  that  a vorticity-velocity  formulation  exists  which  has  the  interesting 
property  that  non-inertial  effects  only  enter  into  the  problem  through  the 
implementation  of  initial  and  boundary  conditions.  This  valuable  characteris- 
tic, along  with  other  advantages  of  the  vorticity-velocity  approach,  are 
discussed  in  detail. 
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Two  distinctly  different  approaches  have  been  utilized  in  the  literature 
for  the  numerical  solution  of  the  equations  of  viscous  flow  in  three- 
dimensions,  In  the  more  common  approach,  the  momentum  equation,  which 
contains  both  the  velocity  and  pressure,  is  solved  numerically  along  with  a 
derived  Poisson  equation  for  the  pressure  (i.e.,  the  pressure-velocity  or 
primitive  variable  formulation  [1-3]).  The  alternative  approach  is  based  on 
eliminating  the  pressure  from  the  momentum  equation  by  the  application  of  the 
curl.  In  this  manner,  a vorticity  transport  equation  is  solved  numerically  in 
lieu  of  the  momentum  equation  (i.e.,  the  vorticity-velocity  formulation  [4- 
6]).  The  purpose  of  the  present  note  is  to  explore  in  more  detail  the 
properties  of  these  disparate  numerical  approaches.  It  will  be  shown  that  the 
vorticity-velocity  formulation  has  a striking  advantage  when  applied  to 
problems  in  non-inertial  frames  of  reference.  More  specifically,  there  exists 
an  intrinsic  vorticity-velocity  formulation  wherein  all  non-inertial  effects 
(arising  from  both  the  rotation  and  translation  of  the  frame  of  reference 
relative  to  an  inertial  framing)  only  enter  into  the  solution  of  the  problem 
through  the  implementation  of  initial  and  boundary  conditions.  This  is  in 
stark  contrast  to  the  pressure  - velocity  formulation  where  non-inertial 
effects  appear  directly  in  the  momentum  equation  in  the  form  of  Coriolis  and 
Eulerian  accelerations — a state  of  affairs  which  can  give  rise  to  a variety  of 
numerical  problems  [2].  A detailed  exposition  of  this  interesting  property  of 
the  vorticity-velocity  formulation  will  be  presented  along  with  a brief 
discussion  of  other  advantages  of  this  approach. 

For  simplicity,  we  will  restrict  our  attention  to  the  analysis  of  viscous 
incompressible  flow  governed  by  the  Navier-Stokes  equation  and  continuity 
equation  which,  respectively,  take  the  form 
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(1) 


where  v is  the  velocity  vector,  p is  the  pressure,  and  v is  the  kinematic 
viscosity  of  the  fluid.  Here,  the  validity  of  (1)  requires  that  the  external 
body  forces  be  conservative  and  that  the  frame  of  reference  be  inertial.  In 
an  arbitrary  non-inertial  frame  of  reference  (which  can  rotate  with  a time- 
dependent  angular  velocity  £2(t)  and  translate  with  a time-dependent  velocity 
Vp(t)  relative  to  its  origin  0),  the  Navier-Stokes  equation  takes  the  more 
complex  form  [7] 


+ v • Vv+ftx  r+ftx  (fl  x r)  + vft  + 28  x v " - Vp  + vv  v. 

d £ ~ ~ rv/  ~ ryj  /v»  ryj  (V  /n>  ryj  /-sy  /%y 


Here,  r is  the  position  vector  and  the  non-inertial  terms  on  the  left-hand 
side  of  (3)  are,  respectively,  referred  to  as  the  Eulerian,  centrifugal, 
translational,  and  Coriolis  accelerations.  The  continuity  equation  still 
assumes  the  same  form  (2)  in  any  non-inertial  frame  of  reference. 

By  the  introduction  of  a modified  pressure  P which  includes  the 
centrifugal  and  translational  acceleration  potentials,  the  non-inertial  form 
of  the  Navier-Stokes  equation  (3)  can  be  simplified  considerably.  More 
specifically,  (3)  can  be  written  in  the  equivalent  form 


9v  # 2 

■r—  + v • Vv  + nxr+2£}xv  = -VP  + vVZ  v, 

O t ^ /v  aj  rv  /v  (V  rsj  * 


(4) 


where 
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p = P + j (S  * l)2  ~ 7 r2  + 


(5) 


• r • 

rsj 


In  the  pressure-velocity  formulation,  equation  (4)  is  solved  in  conjunction 
with  a Poisson  equation  for  the  pressure  which  is  obtained  by  taking  the 
divergence  of  (4).  Hence,  the  governing  equations  to  be  solved  numerically  in 
this  approach  can  be  summarized  as  follows: 


8v 

Jt 


+ 


V 


* 2 

Vv+fixr+2nxv=-VP+vV  v. 


? 

V P = 


- trfVv  • Vv)  + 2J2 

> r>ursj  / ryj 


(6) 

(7) 


subject  to  the  initial  and  boundary  conditions 

2 = ’ at  = V 


(8) 


V 

p 


on  B • 


(9) 


In  (7)  and  (9),  tr(*)  denotes  the  trace,  u>  is  the  vorticity  vector,  and  B 
denotes  the  boundary  surface  of  the  region.  Of  course,  equations  (6)  and  (7) 
must  be  solved  subject  to  the  continuity  equation  (2).  Since  we  are 
considering  general  three-dimensional  flow,  a stream  function  solution  does 
not  exist.  Hence,  the  solution  for  the  velocity  v must  be  projected  in  some 
suitable  fashion  onto  the  space  of  solenoidal  vectors. 

It  is  quite  clear  that  the  form  of  (6)  and  (7)  (and,  hence,  their 
mathematical  character)  change  depending  on  whether  or  not  the  frame  of 
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reference  is  inertial.  Consequently,  a particular  numerical  algorithm  which 
may  be  optimal  for  a given  class  of  flows  in  an  inertial  frame  of  reference 
may  not  be  so  for  the  same  class  of  flows  in  a non-inertial  framing.  It  will 
now  be  demonstrated  that  the  vorticity-veloclty  formulation  does  not  suffer 
from  this  deficiency. 

The  vorticity-velocity  formulation  is  based  on  the  vorticity  transport 

equation  which  is  obtained  by  taking  the  curl  of  (4).  This  equation  takes  the 

form 

9~  2 

vr  + v • Vco  = to  • Vv  + vV  m + 29.  • Vv  - 20,  (10) 

in  any  non— inertial  frame  of  reference  where 

a)  = V x v (11) 

rv  /V  (V 

is  the  vorticity  vector.  It  is  clear  that  the  velocity  and  vorticity  are  also 
connected  through  the  Poisson  equation 

V2  v = - V X (0  (12) 

which  is  a direct  consequence  of  the  vector  identity 

V X (v  X v)  = V(V  • v)  - v2  v.  (13) 

/v  fKf  (V  A/  /S<  /V  /V 

The  intrinsic  vorticity  W,  defined  by 

W = u>  + 2Q,  (14) 

rsj  rv»  /w  * 
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can  be  introduced  which  represents  the  vorticity  relative  to  an  inertial  frame 
of  reference.  Since  g is  spatially  homogeneous  (i.e.,  TO  = 0),  it  Is  a 
simple  matter  to  show  that  the  non-inertial  form  of  the  vorticity-velocity 
formulation  can  be  written  as  follows: 

3W  „ 

-21  + v . VW  = W • Vv  + W W (15) 

v2  V = - v X W.  (16) 

A/  /sj 

Equations  (15)  - (16)  must  be  solved  (in  some  region  R with  a boundary 
surface  B)  subject  to  the  initial  and  boundary  conditions 


W 


<£  x + 


2n 


o’ 


at  t 


(17) 


v = v_ 
~ ~B 


w = (V  X v)D  + 2 n 


on  B. 


(18) 


Of  course,  it  is  well  known  that  the  vorticity,  as  well  as  the  intrinsic 
vorticity,  are  solenoidal,  i.e,, 


V • W = 0,  (19) 

and,  hence,  the  solutions  for  W and  v must,  in  some  suitable  fashion,  be 
projected  onto  the  space  of  solenoidal  vectors. 

This  vorticity-velocity  formulation  of  fluid  dynamics  represented  by 
equations  (15)  - (18)  has  the  striking  property  that  non-inertial  effects  only 
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enter  into  the  solution  of  the  problem  through  the  implementation  of  initial 
and  boundary  conditions.  Consequently,  the  basic  structure  of  the  numerical 
algorithm  (i.e.,  the  numerical  formulation  of  (15)  - (16))  will  be  independent 
of  whether  or  not  the  frame  of  reference  is  inertial — a situation  which 
greatly  enhances  the  general  applicability  of  any  Navier-Stokes  computer  code 
which  is  developed  based  on  this  approach. 

At  this  point,  a few  comments  should  be  made  concerning  the  alternate 
ways  in  which  the  velocity  field  can  be  calculated  in  the  vorticity-velocity 
formulation.  Instead  of  solving  the  Poisson  equation  (16),  it  is  possible  to 
solve  the  defining  equation  for  vorticity  directly,  i.e., 

VxV  = u = W-2ft,  (20) 

r+sj  rj 

(see  Gatski,  Grosch,  and  Rose  [6,8]).  Of  course,  for  plane  or  axisymraetrlc 
flows,  there  exists  a stream  function  iJj  such  that  [7] 


z< 

II 

?>" 

X 

1<1 

-e- 

(21) 

V X (x  X V\|0  = W - 2ft, 

(V  /v  rsj'  rsj  J 

(22) 

where  X ■ Vx  and  x is  the  coordinate  that  the  flow  is  independent  of  (for 

o 

plane  flows,  (22)  reduces  to  the  Poisson  equation  V rjj  = W - 2ft).  While  the 
motion  of  the  frame  of  reference  does  enter  into  the  equations  of  motion  in 
these  alternate  vorticity-velocity  formulations,  it  does  so  in  a much  less 
significant  way  than  in  the  pressure-velocity  formulation.  To  be  specific, 
the  transport  equation  which  is  solved  (i.e.,  equation  (15))  does  not  contain 
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any  frame-dependent  terms  and,  at  each  time  step,  the  partial  differential 
equation  for  the  determination  of  the  velocity  field  is  only  altered  by  the 
addition  of  a constant  forcing  function  in  the  form  of  2ft  (the  added  term  on 
the  right-hand  side  of  (20)  and  (22)). 

Finally,  it  would  be  of  value  to  mention  some  other  advantages  of  the 
vorticity-velocity  formulation.  More  difficulties  have  been  known  to  arise  in 
the  implementation  of  pressure  boundary  conditions  than  vorticity  boundary 
conditions  [1,2]  (of  course,  both  boundary  conditions  must  usually  be 
derived).  Difficulties  in  satisfying  the  continuity  equation  in  the  pressure- 
velocity  formulation  have  also  been  known  to  give  rise  to  numerical 
instabilities  [1].  Furthermore,  in  the  vorticity-velocity  approach,  the 
vorticity  vector  is  calculated  directly.  This  is  of  considerable  value  since 
the  vorticity  field  can  play  an  important  role  in  characterizing  certain 
features  of  turbulence  [9].  While  it  is  certainly  not  being  suggested  that 
the  pressure-velocity  formulation  be  abandoned,  this  study  does  indicate  that 
the  vorticity-velocity  formulation  can  have  distinct  advantages  when  applied 
to  an  important  class  of  viscous  flows. 
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